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Preface 


The number of lectures devoted to traditional subjects such as thermodynamics 
has decreased recently in many honours degree courses owing to the com¬ 
mendable desire to introduce current research topics to undergraduates. The 
leisurely discussion of thermodynamics given in the standard undergraduate 
texts is now rather out of proportion to the time that the student is prepared 
to devote to the subject. One method of saving time is to teach a course purely 
from the atomic view of matter and in some way to ‘derive’ the laws of thermo¬ 
dynamics from the results of statistical mechanics. This approach is also 
claimed to be more likely to arouse the interest of students already familiar 
with elementary atomic physics but has the great disadvantage of presenting 
thermodynamics as a trivial and dependent subject rather than as one of the 
greatest achievements of physics. 

There is of course no doubt that much can be learnt by a judicious mixture 
of the macroscopic and microscopic approaches and so this book covers both 
thermodynamics and statistical mechanics. However, they are first introduced 
separately and then their strengths and weaknesses are further explored by 
examining a number of selected topics from both points of view. A chapter is 
also devoted to the kinetic theory of the transport properties of gases, partly 
because of the importance of the results in such fields as vacuum physics and 
partly to emphasise the great increase in difficulty associated with the study of 
systems away from thermal equilibrium. 

The main objections to the teaching of thermodynamics without the intro¬ 
duction of statistical mechanics have always seemed to me to centre on the 
tortuous approach to the second law of thermodynamics given in most elemen¬ 
tary textbooks. An engineer may find proofs based on hypothetical engines 
driving each other backwards and forwards fascinating, but to many physics 
students these seem rather special phenomena on which to base a general law 
of science. The physical need for a second law of thermodynamics is therefore 
discussed in some detail in chapters 2 and 3 and then the mathematical theorem 
of CaratWodory stated. The distinction between this theorem and the statement 
of the second law of thermodynamics in the form given by Caratheodory is 
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then carefully explained. Once the concept of an integrating factor for the first 
law of thermodynamics has been established the whole subject can be developed 
in a logical fashion and heat engines dealt with en passant. 

The wide application of the methods of thermodynamics and statistical 
mechanics in modern research is demonstrated in chapter 10 where such 
varied subjects as the 3 K stellar background radiation, Pomeranchuk cooling 
of 3 He to within a few thousandths of a degree of absolute zero, the thermo¬ 
dynamic inequalities at phase transitions and negative temperature are treated, 
and references given to fuller accounts in accessible journals. 

It is hoped that these modern examples will both arouse the interest of the 
student and impress upon him the continuing importance of a subject whose 
origins lie in the nineteenth century but which will always occupy a central 
position in physics. 

I am indebted to various colleagues for comments on parts of the manuscript 
to the copyright holders for permission to use certain figures and to Mrs. M. Crag 
for typing the manuscript. 

St Andrews P.C. RIEDI 



To the Student 


Thermal physics is the study of the properties of systems containing a large 
number of atoms. In this sense it therefore covers nearly all physics although it 
will be seen that most of this book is concerned with the special case of systems 
in thermal equilibrium. The concepts of thermal physics, perhaps more than any 
other branch of physics, are most easily grasped in detail once a certain breadth 
of experience has been obtained. This short book — the essentials of the subject 
are developed in 154 pages — is therefore designed to provide a broad foundation 
for the study of the more difficult concepts and applications of thermal physics 
and does not attempt to provide completely rigorous proofs of every point 
which is discussed. Consult the reading list at the back of the book when you 
find a point which does not seem to be dealt with to your satisfaction or when 
you wish to increase your knowledge of some aspect of the subject. The mathe¬ 
matics required for the book is revised in appendixes I - III. 

At the end of each chapter will be found a set of exercises. It is essential 
that the reader attempt these exercises — some of the results of which are used 
in the text — and is able to understand the answers given at the end of the 
book, although a fuller understanding of, say, chapter 3 may well come after 
reading chapter 7. 

The final chapter is designed to show the techniques of thermal physics at 
work. The selection of topics includes some of the most interesting recent 
developments in physics and astronomy as well as a number of more traditional 
topics which were too important to leave out. 

To summarise: try to see chapters 2-1 as a whole , work the exercises and 
study the solutions, use the book list to extend your knowledge. 
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Approximate Values for Fundamental 
Constants 


Charge on proton (e) 

Planck constant ( h ) 

Avogadro constant (N A ) 

Gas constant (7?) 

Boltzmann constant (k) 

Stefan - Boltzmann constant (a) 
Rest mass of electron 
Rest mass of proton 
Rydberg constant (R^) 

Bohr magneton 
Nuclear magneton 


1.6 x 10 -19 C 

6.6 x 10" 34 J s 
6.0 x 10 23 mol" 1 

8.3 JK" 1 mol" 1 

1.4 x 10~ 23 J K _1 

8.6 x 10” 5 eVK" 1 

5.7 x 10~ 8 W m -2 K~ 4 

9.1 x 10“ 31 kg 

1.7 x 10~ 27 kg 

1.1 x 107 m- 1 
9.3 x 10“ 24 JT" 1 
5.0 x 10“ 27 JT" 1 


A pressure of 1 atmosphere = 760 mm mercury = 1.0 x 10 s N m“ 2 = 1.0 x 10 5 Pa 
(1 mm mercury = 1 torr =133 Pa) 

An energy of 1 eV = 1.6 x 10 -19 J 

At room temperature a thermal energy kT « 300 K « 0.026 eV 
Normal temperature and pressure (N.T.P.) T = 273 K, P= 1 atmosphere 
One mole of a perfect gas occupies 22.41 (0.0224 m 3 ) at N.T.P. 



1. Introduction 


The purpose of this chapter is to revise certain concepts which the reader has 
probably already met and to define some technical terms which will be required 
in later chapters. The theme of the chapter is the essential difference between 
the methods which have been developed to handle small numbers of particles 
(mechanics) and those required for the very large number of particles (atoms or 
molecules) contained in a typical laboratory sample of a substance. 


1.1 PRELIMINARY SURVEY 

Classical mechanics is concerned with the motion of a small number of bodies, 
such as the planets of the solar system, in the absence of frictional forces. The 
energy of such a system is constant and the system is said to be conservative. 

The motion of a system of two bodies can be solved exactly, and that of three 
or more bodies by methods of successive approximation. Classical mechanics has 
been well established since the seventeenth century. 

Thermal physics is concerned with the behaviour of systems containing large 
numbers of particles and, not surprisingly, developed much later than classical 
mechanics. The full understanding of the microscopic (atomic) approach to 
thermal physics — called statistical mechanics — was not possible before the 
development of the quantum theory, but the alternative approach in terms of 
macroscopic quantities such as pressure and temperature was one of the great 
triumphs of nineteenth-century physics. 

As an example of the methods of classical mechanics consider the simple case 
of a single particle with mass m in a gravitational field in the z -direction. The 
change in the potential energy of the particle between two points z A and z B is 

U(z B ) - U(z A ) = mg(z B - z A ) (z B > z A ) (1.1) 

where g is the acceleration due to gravity. In a conservative system this increase 
in the potential energy of the particle is equal to the work done on the particle 
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W = [ ZB tngdz = mg(z B - z A ) (1.2) 

J z A 

There are two important points to notice about equation 1.1. Firstly it is only 
the external aspect of the particle which is under consideration not its atomic 
structure. Secondly, the change in the potential energy depends only on the 
coordinates of the initial and final points and not on the route (path ) taken 
between the two points. The variable z may be said to define the state of the 
system and the potential energy is called a function of state. An important 
feature of a function of state is that it is unchanged after a complete cycle , that 
is, on returning to z A from z B . Hence 

AU = 0 (cycle) (1.3) 

When a body possesses both kinetic and potential energy, the total energy is in 
general a function of six variables 

E = \m(v x 2 + v y 2 + v z 2 ) + U(x,y,z) (1.4) 

where v x is the velocity in the x-direction. The behaviour of the body as a 
function of time may be calculated if the potential energy and the initial 
coordinates of position and velocity of the body are known. 

Now a cubic centimetre of a solid contains some 10 22 atoms. A complete 
description of the behaviour of the atoms in this sample of the solid (the system 
of interest or simply the system ) would require an equation of the form of 
equation 1.4 where the energy is now a function of 6 x 10 22 variables and the 
specification of the initial conditions requires 3 x 10 22 coordinates of velocity 
and an equal number of position. It is clear that an attempt to solve this 
equation directly by the methods of classical mechanics is unlikely to be 
successful, although with the aid of fast computers it is now possible to perform 
calculations for a gas of some two hundred particles, as is discussed in the next 
section. A further complication, however, is that the motion of atomic particles 
must be treated by quantum, not classical, mechanics and then, because of the 
Heisenberg uncertainty principle, it is not possible simultaneously to specify the 
exact coordinates of position and velocity of each particle. 

The general case of a system containing a large number of particles is just as 
complicated as it appears at first, and each situation has to be treated separately. 
It is a fact of common observation, however, that if a system is in some way 
‘isolated’ from its surroundings and left undisturbed for a sufficient period of 
time (just what is meant by this is discussed in section 1.2), the properties of the 
system become independent of time. The system is then said to be in thermal 
equilibrium. 

The reason for the equilibrium state being of particular importance can be 
seen by reconsidering the case of the solid sample containing some 10 22 atoms. 
The exact behaviour of an individual atom still cannot be calculated but the 
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average properties of the whole system are now independent of time and it will 
be seen later that it is these average properties which are really of interest. The 
appearance of the word average makes it clear that we are concerned with the 
statistical properties of the system, and the microscopic study of systems in 
thermal equilibrium is called statistical mechanics. The earliest example of this 
approach in physics was the kinetic theory of gases, which is discussed later in 
this section. 

An alternative approach to a system in thermal equilibrium is to ignore the 
internal (atomic) structure of the system and to observe that it may be com¬ 
pletely specified in terms of a small number of macroscopic quantities such as 
pressure, volume and temperature. This is the approach of thermodynamics , 
one of the objectives of which is to find relationships between these measurable 
quantities which will apply to all systems in thermal equilibrium. 

The generality of the results of statistical mechanics and thermodynamics, 
applying as they do to all systems in thermal equilibrium, sometimes makes 
them difficult for the beginner to grasp, so as a concrete example a rather full 
discussion will be given, from both macroscopic and microscopic viewpoints, of 
the properties of the classical perfect gas. 

The concept of a perfect gas arose from the observation that all real gases in 
thermal equilibrium at sufficiently high temperature and low density had the 
same relationship, expressed in appropriate units, between pressure (P), 
volume ( V) and temperature. The macroscopic state of a given mass of a 
perfect gas may in fact be completely specified by any two of these three 
quantities since it will be seen that the third is then determined. A pair of 
numbers such as (P A , V A ) is therefore sufficient to specify completely the 
thermodynamic state A of the system or, as it is sometimes called, the macro¬ 
state. A change in the parameters to (P B , V B ) similarly defines the state B. The 
process of going from A to B is called a change of state. (It may be useful to 
remark here that the change of a system from solid to liquid or vapour will be 
called a change of phase.) 

Consider a fixed mass of gas under constant pressure P in a metal container. 

If the container is immersed in ice it will be found that the gas comes to thermal 
equilibrium with some volume V 0 but the volume increases to some value V 100 
if placed in steam under a pressure of one atmosphere. The state of the gas there¬ 
fore depends on its relative ‘hotness’. A container which allows the contents to 
respond to the hotness of its surroundings is said to have diathermal walls. A 
diathermal wall will be identified in the figures by simply drawing a single line. 

A quantitative measure of the relative hotness of a system is introduced by 
defining a scale of temperature as discussed fully in chapter 2. The Celsius 
scale, for example, may be defined in terms of the volume of a gas at constant 
pressure discussed above such that the temperature (0) of a body when the 
volume of the gas in contact with the body is V Q is given by the equation 



x 100 °C 
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The concept of temperature was not required in mechanics and is only 
meaningful for systems containing many particles. However, a small system 
confined within diathermal walls may be said to have a temperature (P R ) if it 
is placed in contact with a large body with temperature T R . The large body is 
called a thermal reservoir if it is sufficiently large for its temperature to be 
unchanged when the small body is placed in contact with it. 

The experimental relation between the pressure, volume and temperature of 
a perfect gas was determined over a period of many years. The measurements of 
Charles, for example, on a fixed mass of gas at constant pressure, led to the 
equation 


Vq ~ Vo [1 + a(0 — 0 O )1 CP constant) 

where V e is the volume at temperature 6, measured using say a mercury thermo¬ 
meter and a is a constant which has the same value for all real gases in the limit 
of low density and high temperature. 


T = 0 - d 0 + or 1 

Vq = olVqT (P constant) (1.4b) 

once a single fixed point has been defined. 

The triple point of water (section 2.1) is defined to be 273.16 K (degrees 
Kelvin) and then all other temperatures could be found by measuring the 
volume of the perfect gas at the triple point (V t ) and at temperature T. T is 
then found from the relationship 


T = 


273.16 


V T 


K t P 


(P constant) 


However, a gas thermometer is normally operated by measuring pressures at 
constant volume. 

Boyle’s law for a fixed mass of gas at constant temperature 


PV = constant (T constant) 


may be combined with equation 1.4b to give the equation of state 


PV = m 0 R m T 


(1.5) 


where m Q is the mass of gas and R m the gas constant for unit mass. The 
quantity 
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is found to be a constant for all gases where M is the molar mass. 

One mole of a substance is defined as the amount of the substance con¬ 
taining as many molecules as there are atoms in 0.012 kg of the isotope 12 C. 

The molar mass of a substance is the weight per mole relative to that of a mole 
of 12 C, which is taken as 12. The gas law may therefore be written 

PV = n'RT (1.6) 

where n' is the number of moles of the gas. 

According to Avogadro’s hypothesis, a mole of all substances contains the 
same number of particles (N 0 ) so the gas constant may be written 

R = N 0 k (1.7) 

where k is a new constant (the Boltzmann constant) and equation 1.6 then 
becomes 


PV = NkT (1.8) 

P = nkT (1.9) 


for a system containing N molecules, where n is the number of molecules per 
unit volume. 

Equation 1.9 (or those equivalent to it such as 1.5,1.6 and 1.8) is called the 
equation of state of the gas since it classifies all possible thermal equilibrium 
states. When one of the variables, say the volume of the gas, is changed suffi¬ 
ciently slowly for the pressure and temperature to remain constant throughout 
the gas the process is called a quasi-static change. The succession of equilibrium 
states between the initial and final states are all well defined and form a path 
which may be plotted, for example, on a diagram of pressure against volume 
(figure 1.1). A rapid variation of the volume would lead to an initial non¬ 
equilibrium state with no defined pressure or temperature. The initial and 
final states may be represented on the diagram but there is no path connecting 
them. This restricted use of the word path in thermodynamics must be 
remembered. If reversing the condition which led to the change of volume, say 
a quasi-static change of pressure from P A to P B and back to P A , leads to the 
system exactly reversing its original path the process is called a reversible 
quasi-static change. In this case the process must occur not only sufficiently 
slowly but also without friction. A closed reversible path is called a cycle. 

A further discussion of the macroscopic view of the perfect gas will be given 
later in this section, but we now wish to consider the microscopic viewpoint of 
the kinetic theory of gases. A monatomic perfect gas is defined in kinetic theory 
to be an assembly of a large number (N) of particles (atoms or molecules) in 
rapid random motion. The total volume of the particles is considered to be 
negligible relative to the volume of the container and the only interactions 
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Figure 1.1 (a) In a reversible quasi - static change from A to B (or B to A) the system is in 
thermal equilibrium at all intermediate points such as C. ACB is called a reversible path. 
(b) Immediately after a rapid change of volume from to Fg a pressure gradient exists 
in the gas so the system is not in thermal equilibrium. Finally it comes to equilibrium 
with constant pressure Pg. The states A and B are still defined but they are not joined by 
a path. 


between the particles are occasional elastic collisions. The total energy of an 
isolated volume of gas is then simply equal to the sum of the kinetic energies 
of the particles 

E = jm(v 2 

N 

r=l 

An individual particle will change its velocity after each collision but in thermal 
equilibrium there will be on average another particle somewhere in the gas 
whose velocity has just become that of the original particle (or more exactly is 
within some small range of velocities of the velocity of the original particle). The 
kinetic theory view of thermal equilibrium is therefore essentially dynamic , 
since the velocity of individual particles is constantly changing but the fraction 
of the particles within a given range of velocities is constant when the gas is in 
thermal equilibrium. 

The statistical nature of the kinetic theory should now be clear. Only the 
probability of a given particle being found within some small range of velocities 
can be calculated, but the bulk properties of the gas such as pressure depend on 
values of the velocity averaged over all the particles. 

A simple calculation leads to the equation of state 


+ v 2 2 + ... + v N 2 ) 
v 2 = constant 


PV = \Nmv 2 = \E = NkT 


(1.10) 
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V\ + v 2 2 + ... + ujv 2 2 r u r 2 

u =- = - 

N N 

The velocity of an individual atom changes after each collision but v 2 remains 
constant for a system in thermal equilibrium at temperature T. Notice that the 
energy of the perfect gas is a function only of temperature, that is, it is indepen¬ 
dent of its volume. This result is known as Joule’s law and may be written 

= 0 (perfect gas) 

The atomic approach to the perfect gas has therefore already deepened our 
understanding of the macroscopic equations and also draws attention to the 
existence of fluctuations in macroscopic quantities such as the pressure. The 
fluctuations arise purely from the discrete (atomic) nature of matter. There is 
an obvious analogy between the pressure exerted by the atoms of the gas due 
to their elastic collisions with the walls of the container and the fluctuating 
noise heard when raindrops land on a tin roof. The atoms therefore exert an 
average pressure which would be correlated with the pressure used in the 
macroscopic equations, but fluctuations must occur about this mean pressure. 
We shall see, however, that the importance of fluctuations decreases with the 
square root of the number of particles in the system and since a laboratory 
sample of a gas at N.T.P. contains some 10 20 particles the effect of fluctuations 
about the mean pressure is quite unobservable. However, near the critical point , 
where a gas liquefies, the fluctuations in the density of the system become 
sufficiently large to lead to effects such as the scattering of a light beam 
(section 7.7). 

Finally it is useful to consider the terms heat and work in terms of the 
perfect gas before a more general discussion is given in chapter 2. We have seen 
that the state of a gas contained within metal (diathermal) walls may be changed 
by placing it in contact with bodies of different degrees of hotness. The quantity 
of heat that entered the gas may be defined by the equation 



Q = m 0 C x AO (1.11) 

where m 0 is the mass of the gas, C x the specific heat when some parameter x 
(such as the volume or pressure) is kept constant and Ad is the increase in 
temperature, once a value for C x is known. The product m 0 C x is often called 
the heat or thermal capacity of the system (see section 2.6). 

The unit of quantity of heat was originally defined by taking the specific 
heat of water at constant pressure to be unity in the CGS system. The calorie 
was then defined to be: ‘the quantity of heat required to raise 1 g of water 
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through 1 ° C\ The specific heats of other systems in units of calories g -1 °C _1 
may now be found by experiment. 

A separate unit for quantity of heat is, however, no longer required since the 
very careful measurements of Joule showed that heat can be measured in the 
same units as energy (joules in the SI system) as will now be discussed. 

A system within diathermal walls has been seen to respond to changes in the 
relative hotness of its surroundings. A Dewar vessel (thermos flask), however, 
almost isolates its contents from their surroundings (figure 1.2). A wall that 
would ensure perfect isolation from the hotness of the surroundings will be 
called an adiabatic wall. The state of a system within adiabatic walls can, how¬ 
ever, be changed (as seen in figure 1.2), by performing work on it. 


I 



(a) (b) 


Figure 1.2 (a) The state of a system within adiabatic walls cannot be changed by external 
influences such as a bunsen burner, but (b) may be changed by performing work on the 
system. Notice the convention of shading to indicate an adiabatic wall. A diathermal 
wall will be shown by a single line. 


Joule measured the increase in the temperature of a mass of water due to 
the performance of both mechanical and electrical work on it in an arrangement 
similar to that shown in figure 1.2 and concluded that one Joule of energy 
produced an increase in temperature identical to that due to a transfer of 0.23 
calories of heat. The calorie therefore becomes a redundant unit and will not 
be used further in this book. 

It will be seen in chapter 2 that the effect of either a transfer of a quantity 
of heat to a system, or the performance of work on it, is to increase the internal 
energy of the system. The internal energy is a function of the thermodynamic 
state of the system and is therefore unchanged after a complete cycle, as was 
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the mechanical potential energy discussed earlier, but this is not true in general 
of quantity of heat or work. 

The effect at the atomic level of increasing the hotness of a gas is to increase 
the mean square velocity of the molecules, as may be seen from equation 1.10. 
The mean velocity of the molecules is zero, however, since there is no preferred 
direction in space [remember that velocity is a vector quantity involving both 
magnitude (speed) and direction]. The addition of a quantity of heat therefore 
leads to an increase in the random motion of the molecules. A gas performs 
mechanical work , however, by increasing its volume against a pressure P. In the 
case of a reversible process it will be seen in chapter 2 that if the piston moves 
from x A to xb 

W = f * B PA dx =- f Vs PdV (1.12) 

J x A JV A 


where A is the cross-sectional area of the cylinder containing the gas. The 
performance of mechanical work therefore requires the displacement of the 
piston in one definite direction. This may be called ordered movement. 

The conversion into useful work of the energy transferred to a system by 
increasing its hotness therefore involves the exchange of energy in the form of 
random (disordered) motion of the molecules for an ordered displacement in 
one direction. When the relation between heat and work is expressed in this 
fashion it is not surprising that the continuous conversion of heat into useful 
work in an engine cannot, even in a perfect engine, be achieved with 100 per 
cent efficiency. This powerful theorem, discussed in chapter 3, was, however, 
first established by macroscopic arguments and is one way of stating the second 
law of thermodynamics. 

The discussion of the perfect gas given above should have given some idea of 
the two different approaches to the study of systems in thermal equilibrium. A 
rather fuller discussion of the concept of thermal equilibrium is given in the 
next two sections before beginning the general treatment of thermal physics in 
chapter 2. 


1.2 THERMAL EQUILIBRIUM 

When the properties of a thermally isolated system become independent of 
time the system is said to be in thermal equilibrium. Consider the experimental 
system and thermal reservoirs of figure 1.3. Initially the reservoirs are at tem¬ 
peratures Tbi and 7 r 2 . The system and reservoir will now exchange heat — a 
complicated time-dependent process — until finally they come to a common 
temperature and remain at that temperature. The approach to equilibrium 
would depend on the nature of the experimental system but the properties of 
all systems in thermal equilibrium are contained in the laws of thermodynamics. 
The simplicity and the universal character of the thermal equilibrium state 
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Figure 1.3 The thermal reservoirs are initially at temperature 7 ri and Tr 2 - The temperature 
along the metal rod will quickly become independent of time, varying from at one 
end to 7 r 2 at the other. The rod is said to be in a steady state and the heat flow along 
the rod is governed by its thermal conductivity. Finally the whole system of rod and 
reservoirs must come to a common temperature. Only then is the system in thermal 
equilibrium. 


makes it of great theoretical importance although it is clear that only carefully 
prepared experiments are in general in thermal equilibrium with their 
surroundings. 

The equilibrium state should be distinguished from the ‘steady state’ often 
considered in the study of transport properties such as thermal conductivity. 

In figure 1.3 the thermal reservoirs are at temperatures T Rl and T R2 and the 
temperature along the connecting metal rod will quickly become independent 
of time. The rod is now in a steady state — the simplest type of transport 
situation — but the only possible equilibrium state for the isolated system of 
reservoirs and rod is when r Rl is equal to T R2 • 

This book is, apart from chapter 9, entirely devoted to systems in thermal 
equilibrium and it will simply be assumed that such an equilibrium state is in 
fact attainable. One example will be given below, however, of the passage of a 
particularly simple system from an arbitrary initial state to a final equilibrium 
state. The time required for such a transition is called a relaxation time. The 
relaxation times for different systems range from small fractions of a second 
to times too long to be measured in normal experiments. Systems with very 
long relaxation times are said to be in metastable states and, from the macro¬ 
scopic point of view, the metastable state can often be treated as if it were a 
true equilibrium state. 

The simplest system for which the approach to equilibrium can be treated 
in detail is that of a gas of hard spheres. The reasons why real gas molecules can 
sometimes be treated as classical hard elastic spheres, rather than as scattering 
potentials in a wave mechanical calculation, are discussed in section 9.1. The 
full equations of motion of a set of a hundred such spheres can be solved using 
a high-speed computer and the evolution from an arbitrary initial state to 
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equilibrium followed in detail. 

Suppose, for example, that initially all the spheres have been given the same 
speed v and therefore kinetic energy \ mv 2 . The spheres exchange energy by 
making elastic collisions with each other and finally reach the equilibrium dis¬ 
tribution of energies which is independent of time. The energy distribution is 
found to approximate to the equilibrium (Maxwell) distribution after about two 
collisions per molecule (figure 1.4). In a gas at N.T.P. this time is « 10“ 9 s, 
much less than the time taken to perform a change of volume in a normal 
laboratory experiment, but other non-equilibrium states such as a variation in 
density through the gas would take much longer to decay. Notice in figure 1.4 
that fluctuations continue to exist after the system has reached thermal equili¬ 
brium. 



Figure 1.4 A gas of spheres all of which initially have the same speed are computed to come 
to the thermal equilibrium distribution (dotted line) after about two collisions per 
sphere. The gas then performs small random fluctuations about the equilibrium value. 
The function H is defined in chapter 6. 


When the relaxation times of different parts of a system are very different it 
is sometimes possible to observe thermal equilibrium within each part separately 
The nuclei of the atoms in an insulating solid, for example, may be brought to 
any common temperature (T s ) without affecting the temperature of the rest of 
the solid (7J) (section 10.5) because the inter-nuclear relaxation time is much 
shorter than the nuclear - lattice relaxation time (ri). When the solid is thermally 
isolated the nuclear temperature will change towards 7j at a rate determined by 
T\ until a common temperature is reached. 
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1.3 THERMAL EQUILIBRIUM FOR QUANTISED SYSTEMS 


The bulk parameters (P, V, T , etc.) of any system in thermal equilibrium are 
independent of time and the enumeration of these parameters for a particular 
system is sufficient to define the macrostate of the system. Thermodynamics is 
concerned only with the relationships between these macroscopic parameters 
and involves no atomic (microscopic) model of the system of interest. The 
microscopic view of thermal equilibrium is necessarily more complicated than 
the macroscopic since the state of macroscopic equilibrium may generally be 
realised by many different atomic arrangements. 

Consider for example a system of three quantised simple harmonic oscillators. 
The oscillators are almost independent of each other (weakly coupled) and have 
the same energy levels as a single oscillator, e = (n +^)e 0 where e 0 = hv 0 , p 0 is 
the frequency and h the Planck constant. When the energy-zero is taken at the 
level-|e 0 the energy levels are equally spaced at intervals of e 0 . A macroscopic 
restraint has now to be placed on the system. As an example let the total 
energy be equal to 2e 0 . On the microscopic scale this energy may be realised 
in the six different ways shown in figure 1.5. The important feature of this 
trivial example, which is generally true, is that the macrostate contains a 
definite number of microstates. 

The basic assumption of statistical mechanics is that in a closed system in 
thermal equilibrium each of these microstates is equally probable. A system of 
three oscillators is, however, too small to show true thermodynamic behaviour. 

In figure 1.5 each of the possible arrangements occurs three times, but as the 
number of oscillators is increased, one particular arrangement becomes far more 
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Figure 1.5 The macrostate is defined by the condition: TV = 2;E = 2. Six different microstates 
satisfy this macroscopic condition since localised identical particles are treated as dis¬ 
tinguishable (chapter 6). 
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probable than any other (section 6.3) until finally only the most probable 
arrangement has to be found in order to calculate the macroscopic properties of 
the system. 


1.4 AN OUTLINE OF THE FOLLOWING CHAPTERS 

Chapters 2 - 5 are concerned with the macroscopic view of the thermal equili¬ 
brium state. The laws of thermodynamics and the concept of a function of 
state are discussed and applied to some important systems. In chapters 6 and 7 
a microscopic view is adopted and the connection between the two viewpoints 
is also established. Chapter 10 contains no new principles but seeks to illustrate 
the power of the results obtained in earlier chapters by applying thermodynamics 
and statistical mechanics to areas of physics of current research interest. Chap¬ 
ters 8 and 9 are devoted to the kinetic theory of the perfect gas and to the 
non-equilibrium (transport) properties of systems. 



PARTI 

THERMODYNAMICS 



2. First Law of Thermodynamics 


Classical thermodynamics is concerned only with systems in thermal equilibrium. 
A thermodynamic approach to transport properties such as thermal conductivity 
and to biological processes is an important branch of current research, but will 
not be considered in this book. 

When a macroscopic system such as a fixed volume of a gas is isolated from 
its surroundings, its bulk parameters become independent of time after some 
characteristic time for the system has elapsed. A system will be said to be in 
thermal equilibrium if its macroscopic state can be completely specified by 
some small number of macroscopic parameters which are all independent of 
time. At the microscopic (atomic) level processes such as collisions of gas mole¬ 
cules continue to lead to changes in the states of individual molecules but 



Figure 2.1 The temperature dependence of the volume of unit mass of water at constant 
pressure. A specification of (P, V 0 ) near 4 °C is not sufficient to define the state of the 
system since the water temperature could be 0 O or Q\. 
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macroscopic quantities will be seen to involve averages over all the molecules, 
and these average values become independent of time once the system has 
reached thermal equilibrium. 

A simple system will be defined as a system which in thermal equilibrium 
can be completely specified by just two independent variables. A volume of gas 
of fixed mass in which no chemical reactions occur is an example of a simple 
system, since the state of the gas is completely specified by the pressure and 
volume. A volume of water near 4 °C is a non-simple system (figure 2.1), since 
three independent parameters (P, V, T) are needed to completely specify the 
equilibrium state. 


2.1 ZEROTH LAW AND SCALE OF TEMPERATURE 

The most directly accessible thermal concept is not heat (which will be discussed 
in section 2.3) but rather temperature, the relative sensations of hot and cold. 
When a hot body (A) is placed in physical contact with a cold one (B) the 
composite body is always found to have a ‘hotness’ intermediate between that 
of A and B. A body which is hot relative to its surroundings is always cooler 
after the parameters of the whole system have become independent of time. 
There is therefore a definite direction in time for thermal processes which is 
absent from purely mechanical interactions, since reseparating the bodies does 
not lead to them returning to their original states of hotness and coldness. 

The argument may be made more concrete by considering two volumes of 
gas of equal mass which are completely isolated from external influences 
(figure 2.2). The walls of such a container are said to be adiabatic walls. The 
gases are each in thermal equilibrium within their own containers and the 
original states of the two gases are completely defined by the parameters 
(Pai , V) and (Pa 2 > JO- If ^Ai is greater than P A2 > then when the gases are 
placed so that the two containers are in thermal contact, say by using a thin 
metal (diathermal) wall then P Al will decrease to P B1 and P A2 will increase to 
P&2 . When the two containers are reseparated within adiabatic walls these new 
states will continue to exist. The thermal process is therefore irreversible. The 
second law of thermodynamics discussed in chapter 3 is the great generalisation 
of the difference between mechanical and thermal processes. The zeroth law, 
which is discussed in this section is not necessary for a completely logical 
development of thermodynamics, but remains a useful introduction to thermal 
concepts. 

All the possible equilibrium states of a simple system (1) may be described by 
some function of two variables f(P t , Fi) since the parameters (Pj, Vi) com¬ 
pletely define the state. Similarly system 2 will have states given by f(P 2 , V 2 ). 
When these two systems, each in thermal equilibrium, are placed in thermal 
contact, however, the parameters of both systems will in general change with 
time until a new state of thermal equilibrium of the whole system has been 
established. In the special case where the parameters do not change when the 
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(c) 


Figure 2.2 (a) The two systems enclosed within adiabatic walls are each in internal thermal 
equilibrium, (b) When thermal contact is allowed however the pressure in each container 
changes until finally the composite system comes to a new equilibrium state, (c) If the 
adiabatic wall is now replaced the individual systems remain in these states. 


systems are placed in thermal contact the systems 1 and 2 are said to be in 
mutual thermal equilibrium. 

First consider system 1 in the state (Pai , Fai) as the reference system. Then 
/(Pai , Fai ) is equal to some value 0 A i an d all the states of system 2 which are 
in thermal equilibrium with system 1 are classified by the equation 

f(P ai,V ai ) = 0 A I =m,V 2 ) (2.1) 

The function /(P 2 , V 2 ) is called an isotherm (figure 2.3). Now using system 2 as 
the reference system all the states of system 1 which satisfy equation 2.1 can be 
classified so that finally 

fVuV x ) = /(P 2 ,F 2 ) = 6 (2.2) 

classifies all possible states of mutual equilibrium of the two systems. 

The separate equilibrium states of the two systems involve four independent 
variables but the mutual equilibrium states involve only three independent 
variables. The restraint imposed by equation 2.2 is called the empirical tempera¬ 
ture and an equation of the form 


/(p, V) = e 


(2.3) 
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Figure 2.3 The locus of all the states of a system which are in mutual thermal equilibrium 
with a reference system is called an isotherm. 0 (defined by equation 2.2) is called the 
empirical temperature. 


is called an equation of state. 

The zeroth law may now be stated: if system 1 is in mutual thermal equili¬ 
brium with system 2 and with system 3, then system 2 is in mutual thermal 
equilibrium with system 3. 

System 1 can therefore be used to classify all states of mutual equilibrium 
between other systems, that is to say all the states of a system which lie on the 
isotherm having the empirical temperature 0, where 0 is defined by equation 2.3. 
System 1 is called a thermometer. 

A scale of temperature may be defined in terms of equation 2.3 if a value of 
0 is decided upon for some easily reproduceable fixed point or points. The 
Celsius scale is defined, for example, if some property of system 1 is measured, 
say the pressure of a gas at constant volume, in melting ice (0 °C) and the boiling 
point of water under a pressure of one atmosphere (100 °C). Then any other 
temperature on the Celsius scale of system 1 is defined by the equation 

e p = f e ~ p ~ X 100 °c (2.4) 

MOO “ r 0 

The values at the measured points are therefore simply joined by a straight line 
(figure 2.4) and extrapolated as necessary. 

The Celsius scale is still in everyday use but clearly has no fundamental signi¬ 
ficance. In particular it should be noticed that there is no restriction on the 
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Figure 2.4 The Celsius scale for a given thermometric substance is established by measuring 
the magnitude of some parameter at the melting point of ice and the boiling point of 
water under a pressure of 1 atmosphere. All other temperatures are found by measuring 
the parameter and either reading off the temperature from the graph or by using an 
equation of the form of equation 2.4. 


range of temperature since equation 2.4 may be extrapolated to plus and minus 
infinity. 

The temperature of a given simple system is uniquely defined by equation 2.4, 
and all other systems in mutual thermal equilibrium with the reference system 
are correctly classified but the number relating to the temperature will depend 
upon the thermometer. Consider two thermometers, say the length of a column 
of liquid such as mercury and the resistance of a metal. Then 

0, = le /o . X 100 °C 
fioo ~ h 

0 R = x 100°C 

^100 _ *0 


by definition. 

Suppose however that the resistance of the wire is measured as a function of 
temperature (using the mercury thermometer) and the resistance is found to 
have the form 


R e = *o(l + bd t + c6 l 2 ) 
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where R 0 ,b,c are constants. Then 


b6 l + c0, 2 
b + 100c 


(2.5) 


and Or is not equal to 0/ (except at the fixed points) unless c is zero (figure 2.5). 
It is therefore not sufficient when temperature is defined by equation 2.4 to say 
that a solid melts at 50 °C, the particular thermometer must also be specified. 



0j(°C) 

Figure 2.5 The relationship between the temperature measured using a mercury thermo¬ 
meter and a resistance wire (equation 2.5). The constant c has been taken to be negative 
as in exercise 2.1. 


The Celsius scale as defined above is therefore unsatisfactory because (a) no 
physical reasoning lies behind the scale, and (b) the number given for the tem¬ 
perature of a body will depend upon the thermometer in use. The ‘fixed points’ 
of the Celsius scale are also unsatisfactory because they cannot be reproduced 
experimentally to the highest accuracy required in modern thermometry 
(« 0.001 °C). 
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The objection to the Celsius scale made in point (a) above can only be fully 
understood after the second law of thermodynamics has been established in 
section 3.1. The objection in point (b) could be met by always using the same 
thermometer. The perfect gas scale of temperature is the chosen scale because, 
as shown in section 3.1, it is identical to the thermodynamic scale of temper¬ 
ature which is independent of the thermometric substance. 

The perfect gas scale is defined using the equation of state 

PV 

T=~ (2.6) 

R 

for 1 mole of a perfect gas, where R is the gas constant. 

The size of the degree is defined by the triple point of water, where ice, 
water and water vapour are in equilibrium (« 0.01 °C) being taken as 273.16 K 
where K means degrees Kelvin (after the inventor of the thermodynamic scale). 
Then any other temperature is defined by 

T = x 273.16 K (2.7) 

(P1^2 73.16 

The thermodynamic scale may be shown to be always positive (once the 
triple point has been defined to be positive) and therefore implies an absolute 
zero of temperature (section 3.1). 

The Celsius scale is now treated as a derived quantity defined by 

6 = (T - 273.15) °C 

The choice of 273.16 K for the triple point was made so that the fixed points 
on the Celsius scale were nearly unaltered from their original values. 

The actual measurement of temperature on the thermodynamic scale is 
extremely difficult. A real gas will not have an equation of state given by 
equation 2.6 but for sufficiently high temperatures an extrapolation to zero 
density of the product (PV) does lead to the gas scale (section 2.2). Then 

T = T - ( P V } . T . - 1 x 273.16 K 

L (^F)273.16_|i’-0 

A direct measurement of temperature on the thermodynamic scale is however t 
rarely attempted outside a standards laboratory such as the National Physical 
Laboratory or the National Bureau of Standards. 

The international scale of temperature is a set of reproduceable fixed points, 
some of whose values on the thermodynamic scale are given in table 2.1. A con¬ 
venient thermometer, the resistance of platinum wire in the temperature range 
14-900 K for example, is then chosen and its properties measured at the fixed 
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TABLE 2.1 


The International Scale of Temperature 


(K) 

Triple point of hydrogen 

13.81 

Boiling point of hydrogen 

20.28 

Boiling point of oxygen 

90.19 

Triple point of water 

273.16 

Boiling point of water 

373.15 

Freezing point of zinc 

692.73 

Freezing point of silver 

1235.08 

Freezing point of gold 

1337.58 


Some of the important reference temperatures on 
the International Scale of Temperature. The pressure 
is understood to be exactly 1 atmosphere for the 
boiling and freezing points. 

points within its range. An interpolation between the fixed points, either by 
curve-fitting using a computer or else by a correction formula similar to equa¬ 
tion 2.5, enables a table of resistance against thermodynamic temperature to be 
constructed. The scale is of course not linear in the resistance since the resistance 
of platinum is never exactly proportional to the thermodynamic temperatures. 

In table 2.2 some values of 



are given for platinum and germanium resistance thermometers. The value of z 
is a rough measure of the sensitivity of the thermometer and would be equal to 
1 if the resistance was directly proportional to the thermodynamic temperature. 
The germanium resistance thermometer can be seen to have low sensitivity at 
high temperature - where the platinum resistance thermometer is obviously 
superior — but improves as the temperature is lowered. The measurement of 
temperatures on the thermodynamic scale at low temperature presents extra 
difficulties, one solution to which is discussed in section 10.6. 


2.2 EQUATION OF STATE 

A simple system has been defined as one whose state is completely specified by 
two independent variables. The equation which contains all equilibrium states of 
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TABLE 2.2 


The performance of two typical resistance thermometers 
as a function of temperature 

Temperature 

Platinum thermometer 

Germanium thermometer 

n k ) 

£(S2) 

T (dR \ 

R v dr/ 

R(S 2) 

T (dR ) 
R\dT J 

1.5 

_ 

_ 

13449 

3.74 

4.2 

0.0113 

1.48 

1099 

1.62 


0.1073 

3.52 

193.36 

3.89 


7.3387 

1.50 

4.868 

0.45 


28.318 

1.07 

- 

— 

770 

72.536 

0.92 

— 

— 


The performance of a platinum resistance thermometer is seen to be inferior to that of a 
typical germanium thermometer at temperatures below 20 K but becomes greatly superior 
to it at high temperature. 


the system is therefore of the form 

T = /(P, V) 


or equivalently 


P = f(V,T) 

K = /(P,r> 

The two parameters need not be pressure and volume, which have been used up 
to now because a gas has been used for illustrative examples. For the stretching 
of a wire at constant volume, for example, they would be tension and length. 
The important feature of a simple system is that T is a function of only two 
variables so that 


T = f(x,y) = T( Xi y) 

The second expression will be used from now on to indicate that T is some 
function of x and y. 

The simplest equation of state is that of the perfect gas 


n 0 RT 


n 0 N 0 kT _ NKT 

~~V V 


(2.9) 


for n 0 moles. 
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This equation is also a good description of a real gas at high temperature and 
low density. The isotherms on a P-V diagram (figure 2.6) are hyperbolae. At 
low temperature and high density however, a real gas will liquefy and one 
section of each isotherm is parallel to the F-axis (figure 2.6). Equation 2.9 is 
derived in section 7.5 for a gas of particles of negligible total volume whose only 
interactions are elastic collisions. The energy of the particles is therefore purely 
kinetic energy. At low temperature and high density these restrictions are no 
longer valid for a real gas and a more complicated equation of state is required. 



Figure 2.6 The isotherms of a real fluid. At high temperature and low density (large volume) 
the perfect gas equation (2.9) is satisfactory but at temperatures below T c the gas lique¬ 
fies. The perfect gas equation is therefore unsatisfactory except for T> T c and low 
density. 


The virial expansion is one such equation of state and may be written either 

PV = A ^1 + £? + ...^ (2.10) 

or 

PV = A + B P P + C P P 2 + ... (2.11) 

The series may be understood as the corrections to the perfect gas law due to 
pair interactions between molecules ( B ), three-body interactions (C) and so on. 
The coefficients A , B and C are functions of temperature. The coefficient A is 
simply equal to RT as shown in exercise 2.2. The virial expansion is most useful 
for relatively small departures from the ideal gas law ( C v V~ 2 <B v V~ x < 1) 


First Law of Thermodynamics 


27 


where simply B v is often a sufficient correction. The series does not converge 
as the temperature approaches the critical temperature (T c in figure 2.6). 

The virial coefficients can be measured directly for real gases and also cal¬ 
culated for certain models of intermolecular interactions. A more ambitious 
approach is to attempt to find an equation containing parameters independent 
of the temperature which would reduce to the perfect gas law in the limit of 
high temperature and low density, but would also fit the other isotherms in 
figure 2.6. The first attempt at this type of equation of state was due to 
van der Waals, who introduced two constants, V to correct for intermolecular 
attraction, and V to correct for the finite size of the molecules 



for 1 mole. 

The isotherms of this equation are shown in figure 2.7. The equation is cubic 
in V and therefore has one or three real roots. The isotherms are clearly not 
identical to figure 2.6. The states along AB and CD are metastable and may be 
reached experimentally. The portion BC is, however, mechanically unstable 
since ( dP/dV) T is positive. (A small decrease in the pressure in region BC would 
lead to a decrease in the volume, which is clearly unstable.) Maxwell has shown 
that the best van der Waals isotherm is that for which the area ABE is equal to 
ECD (see figure 2.7). 



Figure 2.7 Isotherms of a van der Waals gas. The states along AB, CD are measurable. BC is 
unstable. The dotted line shows the Maxwell construction. 


Many other two-parameter functions of state have been introduced since the 
van der Waals equation but none of them are able to reproduce all the features 
of a real fluid. The van der Waals equation therefore remains a useful approxima¬ 
tion and will be used throughout the book to distinguish between the properties 
of real and perfect gases. 
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The flat section of the isotherms of a real fluid shown in figure 2.6 can be 
seen to become smaller as the temperature rises. At the critical temperature T c 
it becomes a point of inflexion (P c , V c ) and therefore 


d 2 P \ = /3P\ 

dV 2 ) T [dvJ T 


(2.13) 


at the critical point. Using equation 2.12 



3b T c 


8 a 
21Rb 


RT C = 8 
We 3 




&T' 


P' 



T' 


T 

T~c 


(2.14) 


(2.15) 


(2.16) 


The van der Waals equation therefore predicts a value of 8/3 for RTJP C V C for 
all gases rather than the perfect gas value of unity. The experimental values are 
however always greater than three and also depend upon the gas. For example, 
RTJP C V c for hydrogen is 3.06 and for helium is 3.08 but for nitrogen and 
oxygen it is 3.42. The reduced equation of state given in equation 2.16 is a 
universal curve which all gases would obey if the van der Waals equation was 
correct. In fact any two-parameter equation of state may be brought into 
reduced form. 

The three equations of state introduced in this section will be used through¬ 
out the book to observe the difference in the behaviour of real and perfect gases. 
The reader must remember however, that close to the critical point all three 
equations are inadequate. The gas - liquid transition cannot be followed by 
equations of this form. A more subtle point to note is that a real gas with an 
equation of state other than PV = RT may, under certain special conditions, 
behave like a perfect gas. In exercise 2.2 it is shown that Bp = By = b-a/RT. 
There is therefore a temperature (the Boyle temperature T B ) equal to a/Rb 
where B is zero. The equation of state given by equation 2.10 is equivalent to 
the perfect gas equation PV = RT over a wide range of pressure at the Boyle 
temperature. However if the property being studied involves not B but, say, 
dB/dT, the real gas will not behave like a perfect gas at the Boyle temperature. 
In the throttling process discussed in section 5.3, for example, the important 
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parameter is [T(dB/dT) — B] and the real gas behaves like a perfect gas not at 
7g but at 27g. 


2.3 FIRST LAW OF THERMODYNAMICS 


In mechanics, the potential energy of a mass m at a point z where the accelera¬ 
tion due to gravity is g , is defined to be 

U(z) = mgz 

Since the reference plane has been chosen arbitrarily, U(z ) is defined only to 
within a constant. The difference of potential energy between two points is, 
however, completely defined since 

U(z B ) - U(z A ) = mg(z B - z A ) (2.17) 

In a system without dissipative forces such as friction (a conservative system) 
the increase in the potential energy between z B and z A is simply equal to the 
work done on the mass m 


J Z B 
Z A 


mgdz = mg{z B - z A ) - U(z B ) - U( z a) 

(2.18) 


The change in the energy of the system in going from z B to z A depends only on 
the final coordinates and not on the path by which the coordinates were reached 
(figure 2.8). 

The potential energy is just one of many forms of energy. The law of con¬ 
servation of energy states that energy can neither be created nor destroyed. In a 
closed system without dissipative forces, the total energy may be changed, say, 
from potential energy to kinetic energy and back to potential energy, provided 
that at all times the sum of potential and kinetic energies remains constant. 

\mv B 2 + mgz B = jmv A 2 + mgz A 


It sometimes becomes necessary to extend the definition of energy to ensure that 
the energy of a closed system is conserved. The change from Newtonian 
mechanics to relativistic mechanics for example, involved the recognition that 
the rest mass of a particle m 0 contains energy m 0 c 2 where c is the velocity of 
light. A decrease of mass in a closed system therefore involves the release of 
energy (as occurs in a nuclear reaction). When all sources of energy have been 
taken into account however, the law of conservation of energy appears to be 
completely rigorous. 

There are three features of equation 2.18 which should be noted: (a) the 
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Figure 2.8 The difference in the potential energy of a particle at two points is independent 
of the path joining the points. In a conservative system the work done on the body to 
take it from A to B is equal to the work done by the body if it returns from B to A. 


equation may be read in either direction. If the particle is at z A then work (W) 
may be done on the mass to move it to z B . Alternatively if the particle is at z B 
it is available to do work W as it returns to z A , (b) the temperature of the body 
does not appear in the equation. Mechanics is concerned only with the external 
aspect of the body, (c) the change in the potential energy depends only upon 
the initial and final coordinates, not upon the path by which the coordinates 
were reached. In particular if the particle is at z A and completes a cycle back to 
z A the change in the potential energy is zero. This is written 


A U = fmgdz = 0 (2.19) 

The function U(z) is called a function of state since it is defined to within a 
constant by the coordinates (the state) of the system. 

The first law of thermodynamics is an extension of the concept of conserva¬ 
tion of energy to include thermal processes. An internal energy function U, 
which for a simple system will be a function of two independent variables, is 
postulated. U(T , V) for example will be defined only to within an arbitrary 
constant but U(T B , K B ) — U(T A , V A ) is, like the change of potential energy in 
equation 2.17, completely defined. We will simplify the notation to U(B ) where 
it is to be understood that B defines the state of the system. 

The question remains of how to establish the properties of the internal 
energy function and indeed to confirm its existence by experiment. The clue is 
provided by equation 2.18. A system in thermal equilibrium in state A is con¬ 
fined within rigid adiabatic walls. The system is therefore isolated from all out¬ 
side influences but it is still possible to perform work on the system (figure 2.9). 
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(a) (b) 


figure 2.9 (a) A system in thermal equilibrium within rigid adiabatic walls. The system is 
isolated from outside influences, (b) The state of the system may be changed by doing 
work on the system by either mechanical or other means. 


The experiments of Joule showed that regardless of whether the work performed 
under adiabatic conditions was mechanical or electrical, a quantity of work W 
always led to the same final state B of the system. 

Wad = TO - U{ A) (2.20) 

The internal energy function of any system is therefore to be found by per¬ 
forming work on the system under adiabatic conditions. (The meaning of this 
equation will become clearer in equation 2.51.) 

When the system is in state A but not surrounded by adiabatic walls, the 
work W required to take the system to state B is found to depend upon the path 
by which the process takes place. The difference between W a d and W is called 
the quantity of heat, or simply the heat, added to the system. 

U{ B) - U( A) = W + Q (2.21) 

The sign convention here is that W is positive if work is done on the system and 
Q is positive if heat is added to the system. In older textbooks the convention 
W is positive if work is done by the system, is used and equation 2.21 would 
then contain — W. The first law of thermodynamics is therefore a law of the 
conservation of energy which includes quantity of heat as a form of energy 
transfer. 
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2.4 THE REVERSIBLE QUASI-STATIC PROCESS 

The function U is to be considered as a continuous differentiable single-valued 
function of the temperature and of one other coordinate in a simple system. The 
differential d£7 is therefore well defined. It is a perfect differential of two vari¬ 
ables for a simple system say (T, V ) and 


d U 


dU\ ^ 
— dT + 
dTjv 


bU\ 

— | dV 

dV J T 


( 2 . 22 ) 


The differential of a function of two or more variables is discussed in appendix 
I. (7(B) — (7(A) is independent of the path between states A and B and may be 
obtained by integrating equation 2.22. 

When Q is zero (the adiabatic process), equation 2.20 is regained and W is a 
state function. Similarly if W is zero, Q is a state function. In general however, 
the magnitude of W will depend upon the path, as will be shown in section 2.5, 
and since (7(B) — (7(A) is independent of the path, Q also changes to compensate 
for changes in W. There are no unique functions of state W and Q and therefore 
no perfect differentials dW and d Q (see appendix I). 

When the infinitesimal change d(7is carried out so slowly that the process 
takes a long time compared to the longest relaxation time of the system, the 
change is said to be a quasi-static process. If reversing the small change in the 
parameters of the system also takes the entire system back to its original state, 
the process is a reversible quasi-static process. The process must take place not 
only sufficiently slowly but also without friction. Under these special conditions 
the infinitesimal work term is well defined and may be written d“IT R . The sym¬ 
bol d" rather than d is used to emphasise that there is no unique function of state 
W. Similarly the heat change may be written cT Q R . The suffix R restricts the 
changes to reversible processes. Then 


dU = cf<2 R + cW R 


(2.23) 


is the statement of the first law of thermodynamics for an infinitesimal reversible 
change of state. The right-hand side of the equation may be integrated from 
state A to state B if the reversible path between the two states is specified. 
dQ R and cf are called imperfect differentials. 


2.5 WORK 

Consider the reversible quasi-static change shown in figure 2.10. The gas in the 
cylinder expands slowly against a pressure equal (to within an infinitesimal 
variation) to its own. Then the work done on the gas is given by 


cTWr = - Fdx = - PA dx = - PdV 
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Figure 2.10 A reversible quasi-static expansion of a gas in a cylinder of cross-sectional area 
A. The work is the area between the path and the F-axis. 


The total work done in a reversible change from state A to state B is 

W=-j*PdV (2.24) 

The work done is therefore simply the area between the path from A to B and 
the F-axis (figure 2.10). The work done depends upon the path. In a complete 
cycle the change in the internal energy is zero, but the work done 


W = - fPdV 


(2.25) 


may be positive, negative or zero (figure 2.11). Notice that 
$ A 'PdV= -$ A ,PdV 

where the arrow indicates that the cycle is performed clockwise or anticlockwise 
from A along the same reversible path in each case. 

The calculation of W using equation 2.24 therefore requires the equation of 
state of the system to relate P to V, and also the path by which the system 
moves from A to B. Two simple examples will now be given. If the system is a 
perfect gas, then PV = RT and 
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Figure 2.11 In a cycle the change in the internal energy is zero but the total work is equal to 
the area enclosed by the cycle, (a) Net work done by the gas, (b) net work done on the 
gas, (c) net work zero. 


In a reversible quasi-static change from V A to V B at constant temperature T 0 
(an isothermal change) the integral becomes 

w -- RT -fX 7 (y A ) W 

An even simpler example is a change at constant pressure P 0 . In this case the 
equation of state is not required since 

w = - P 0 rp 5 d v = - P 0 ( V B - V A ) (2.27) 

J V A 

The work term is not necessarily of the form PdV, but in a simple system 
with only two independent parameters can always be written in the form X dx. 
A wire extended dl under a tension at constant volume has a work term 
& d/,for example. This term is positive because / increases with ^ unlike V 
which decreases as P increases. The work term for a system in a magnetic field 
is discussed in section 10.6. 


2.6 SPECIFIC HEAT OR HEAT CAPACITY 

When a quantity of heat d ~Q B is added to a system the temperature of the body 
usually increases. The heat capacity of the body is defined as 


C x = 


(&Qr\ 

V dr ) x 


(2.28) 
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Here x refers to the constraint under which Cis to be measured. Since Q is not 
a function of state the heat capacity is meaningless unless the constraint is 
specified. The most important quantities are the heat capacity at constant 
pressure 


C P 



(2.29) 


because this is the quantity usually measured for a solid, and the heat capacity 
at constant volume 



which is the quantity calculated by statistical mechanics. The heat capacity under 
conditions of saturated vapour pressure is also important for liquids. 

The specific heat is the heat capacity of unit mass of a substance. The unit is 
usually taken to be the mole in thermodynamics and the molecule in statistical 
mechanics. The relation between the two is then simply the Avagadro constant. 

When a system changes its phase (solid - liquid, liquid - vapour, solid - vapour) 
the heat absorbed per unit mass by, say, the solid at the melting point is called 
the latent heat. The specific heat at the phase transition is infinite, since the 
system absorbs a quantity of heat without change of temperature. 

The specific heat of a substance is in general a function of both temperature 
and volume and as the most experimentally accessible thermal quantity is of 
great theoretical and experimental importance. The specific heat of solids is dis¬ 
cussed in section 10.3 and of gases in section 7.5. 

A relationship between C P and Cy of any system may be found using only 
the first law of thermodynamics, but it will be possible to simplify the general 
result after the second law has been established. The first law for an infinitesimal, 
reversible quasi-static change of a simple system may be written using equations 
2.22-2.24. 


Therefore 


aQ *-{w\ iTt ['*(iH] dK <2J1) 

imJ\ 

l iT ), " Ur| " Cr < 2 - 32 > 


giving 


<re R =c,dr t [r.(^)Jdr 


( 2 . 33 ) 
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The left-hand side of the last equation is equal to C P so 

Cr ~ CvS <234) 

is the general result. The term P(dVjdT)p may be evaluated from the equation 
of state of a particular system, but further information is required to find 
(dU/dV) T . This term can, in fact, be eliminated after the second law of thermo¬ 
dynamics has been established in section 3.1. The internal energy of a perfect gas, 
however, is independent of its volume (Joule’s law) so (3 U/bV) T is zero. Then 



for one mole of a perfect gas and 

C P - C v = R = N 0 k (2.35) 

Notice that the specific heats of a perfect gas with two or more atoms per 
molecule may be a function of temperature. At any given temperature however 
the difference in the molar specific heats is equal to R . The difference in the 
specific heats per molecule of a perfect gas may be written 

cp — cy = k (2.36) 

where k is the Boltzmann constant. 


2.7 HEAT ENGINES 

Thermodynamics began as the study of the efficiency of heat engines. An 
engine will be defined as a machine which at the end of a complete cycle has 
converted some heat into useful work. The process can then be repeated for 
another cycle and so on. Since A U is zero for a complete cycle the first law 
(equation 2.21) becomes simply 


W = - Q 


(2.37) 
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Figure 2.12 An engine absorbs heat from one thermal reservoir, does work, rejects heat to a 
second thermal reservoir at a lower temperature and then returns to its original state. The 
whole operation is called a cycle. 


The simplest possible engine is represented in figure 2.12. The engine extracts 
heat Qa from the hot body, performs work W , rejects heat Q B to the cold body 
and returns to its original state. When the thermal capacity of the bodies is so 
large that the change of Q A or Q B does not effectively change their temperature 
they are called thermal reservoirs. The work and heat processes in the cycle are 
related by equation 2.37 

W = Qb ~ Qa 

(remembering that work done by the system is negative). The efficiency of an 
engine ( 17 ) is defined to be 


Work done 

17 =- 

Heat extracted from the hot body 

= _ JV = Qa - 6b 
Qa Qa 

The statement that figure 2.11 is a representation of the simplest possible 
engine cannot be maintained on the basis of the first law of thermodynamics 
alone. The first law may be stated, on the basis of equations 2.37 and 2.39. 
The efficiency of an engine working in a cycle cannot exceed unity. 


(2.38) 

(2.39) 


r? < 1 (First Law) 


(2.40) 
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The efficiency is unity if no heat is rejected to the cold reservoir ( Q B = 0). The 
second law, to be considered in chapter 3, imposes the more rigorous condition 

T? < 1 (Second Law) (2.41) 

that is to say a source and a sink of heat are required for an engine since Q B =£ 0. 
The efficiency of real heat engines is always far less than unity of course, but in 
chapter 3 the maximum theoretical efficiency of a heat engine will also be 
shown to be in agreement with equation 2.41. 

There is a limit therefore to the efficiency with which heat can be converted 
into work by a cyclic process. Heat can however be completely converted to 
work in any non-cyclic process for which there is no change in the internal 
energy of the system. A perfect gas, for example, has 

The internal energy of a perfect gas is therefore a function of temperature U(T ) 
only. In a reversible isothermal expansion of a perfect gas AU = 0 and 

Q = - §&W = RT 0 ln ( 2 42 ) 

from equation 2.26. Heat Q has been completely converted to work but the 
system is now in the state B, having extracted heat Q from its surroundings to 
keep the temperature of the gas constant (figure 2.13). 



Figure 2.13 An isothermal expansion of an ideal gas from A to B converts all the heat 
absorbed from the thermal reservoir to work since the internal energy of a perfect gas 
depends only upon the temperature, but the process is not cyclic. 
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2.8 CONCLUSIONS BASED ON THE FIRST LAW 

The first law of thermodynamics has been established to be in general 

£/(B) - U( A) = Q + W 

and for a reversible infinitesimal change 

GU = tf<2 R - PdV 
J<2r = d U + PdV 


(2.43) 


(2.44) 


U is a function of state and for a simple system is a function of two independent 
variables. Considering U as a function of temperature and volume [U(T, V) ] 


CffiR - Cy d T + 


a-Ha} 

HSl- 


(2.45) 


cfg R is well defined but depends upon the path by which the changes in 
temperature and volume occur. 

The first law in the form of equation 2.44 for a simple system contains only 
one work term (— P dV). The sum of all the (n — 1) reversible work processes 
occurring in a general system may be written 


6W r - 2 x t**r 
r =1 

where X r is a generalised force (such as — P or and dx r is a generalised 
infinitesimal displacement ( dV, d/). The internal energy now becomes a function 
of n variables, say the temperature plus the (n - l)x r and 


The first law now becomes 


[(iD/*]* (246) 


^ Y r d y r 


(2.47) 
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Here (dUjdT) x means that all the (n — l)x r are to be held constant and (3 U/dx r ) c 
means that the temperature and all x other than x r are to be held constant while 
performing the differentiation. The final form of equation 2.47 will be found 
useful in chapter 3, the significance of the Y r and dj v r should be clear from 
equation 2.46. 

The parameters which occur in equation 2.44 or 2.46 may be divided into 
two types. The parameters which are independent of the size of the system 
(i\ 3^, T ) are called intensive parameters; those proportional to the size of the 
system ( U , Q , C v , V, l ...) are called extensive parameters. The internal energy, 
for example, could be written U=uV where u is the internal energy per unit 
volume of a uniform system. This division is only correct for large systems 
where surface effects are unimportant relative to the bulk contribution to the 
thermodynamic quantities. 

In establishing equation 2.43 in section 2.3 we began with the purely 
mechanical equation for the potential energy of a body 

W = U(z B ) - U(z A ) (2.48) 

and following Joule’s experiments on the conversion of work into heat we 
wrote, by analogy 


H'a d = U(B) - U(A) (2.49) 

for a system within adiabatic walls. 

Although these two equations appear to be identical there is in fact a most 
important distinction to be made between them. In section 2.3 we saw that in 
mechanics z B and za were on an equal footing (equation 2.18), either work 
was done on the particle to raise it from z A to z B or else work was done by the 
particle in going from z B to z A . The outstanding result of Joule’s experiments 
for a system within rigid adiabatic walls was that while any pair of states (A, B) 
were found to be accessible by a suitable adiabatic work process, if B could be 
reached from A then A could not be reached from B. 

The difference between equations 2.48 and 2.49 is therefore that W can be 
either positive or negative in equation 2.48 but only positive in equation 2.49. 
Adiabatic work can always be done on the system — say by the paddle in figure 
2.9, to increase its internal energy — but a system within rigid walls cannot, 
under adiabatic conditions, return to its original state while conserving energy 
by performing work on the paddle. The law of conservation of energy is not in 
itself sufficient to describe thermal processes. A second law of thermodynamics 
is therefore required, as will be seen in chapter 3. 

When the system is enclosed within adiabatic but non-rigid walls equation 
2.49 is still applicable, but the relationship between pairs of states is rather more 
complicated, since the coordinates of the systenvcan be varied even when the 
paddle is not used to perform work on the system. States A and B! may then 
exist which are mutually accessible along the reversible adiabatic curve shown 
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Figure 2.14 A system in thermal equilibrium within non-rigid adiabatic walls. The system 
may be able to move from A to Bj and from Bi to A as the pressure is varied (reversibly) 
when no electrical work is performed. All the points in the shaded region can be 
reached from A by a combination of electrical work and change of volume but A cannot 
be reached from B 2 . 


schematically in figure 2.14. The point B 2 is also accessible from A as the result 
of a combination of external work and reversible adiabatic changes. In fact the 
whole of the shaded region in the figure is accessible from A, but A is not 
accessible from states within the shaded region. 

The general relationship between the states of a system within adiabatic 
walls is therefore that if A is not accessible from B then B is accessible from A 
but certain pairs of states A and B may be mutually accessible. 

In the constant volume process under adiabatic conditions discussed in 
section 2.3, Q = 0 and 

W'ad = £/(B) - U{ A) = J® cy dr (2.50) 


When the thermal capacity is independent of temperature this becomes simply 

Wad = C v (T b - 7a) (2.51) 

Joules’ experimental conclusion - that if for a system at constant volume the 
state B could be reached from A by performing adiabatic work then the state A 
could not be reached from B by performing adiabatic work—is now seen to be 
equivalent to saying that the temperature of a system within rigid adiabatic walls 
can be increased by performing work on it but this energy is not recoverable as 


42 


Thermal Physics 


external work. The system does not spontaneously cool back to T A while per¬ 
forming work on the paddle in figure 2.9. This irreversibility of thermal and 
mechanical processes is the fundamental problem, going beyond the law of 
conservation of energy, which will be discussed in chapter 3. 


EXERCISES 

2.1 The resistance of a wire is Re at a temperature 0 °C measured on the ideal 
gas scale and 


R e = jR 0 0 + oB + 00 2 ) 

where R 0 is the resistance at 0 °C, a = 3.5 x 10“ 3 (°C) _1 ,0 = — 3.0 x 10~ 6 (°C)“ 
Calculate the temperature on the resistance scale when an ideal gas thermometer 
reads 70 °C. 

2.2 The virial equation may be written as in equations 2.10 or 2.11. Show that 
(a) A=RT, (b )B P =B V , (c) C v =B P 2 + AC P , (d )B = b- a/RT, (e) C v = b 2 
where a and b are the van der Waals constants. 

2.3 Derive equations 2.14-2.16. 

2.4 Find expressions for the work done by a gas in expanding from volume V\ 
to V 2 (a) at constant pressure, and (b) at constant temperature (isothermal 
expansion) for the following equations of state 

(i) PV = RT 

(ii) P(V-b ) = RT 

(iii) (P + ~)V = RT 

( iv ) (P + -p (V-b) = RT 

Distinguish between external work and work to overcome intermolecular forces. 

2.5 Find the change in the internal energy of one mole of a monatomic perfect 
gas in an expansion from 5 m 3 to 10 m 3 at a constant pressure of 1 atmosphere 
(10 s Nm -2 ) if 


Cp/Cy — 7 “ 5/3. 
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2.6 Two moles of a monatomic perfect gas are initially at a temperature of 600 K 
and a pressure of 2 atmospheres. The gas expands reversibly to twice its original 
volume. Calculate (a) the work done by the gas, (b) the heat supplied to the gas 
and (c) the change in the internal energy of the gas. 

2.7 A perfect gas is taken through the following cycle: (ab) isothermal expansion 
at temperature T x from volume V x to volume 2V X , (be) compression at constant 
pressure P x to volume V t , (ca) change of pressure at constant volume. Find 
expressions for the work done, the heat transferred and the change in the 
internal energy of the gas in each part of the cycle and show that the sum of the 
three terms is zero for the internal energy only. 

2.8 One mole of water at its boiling point is converted into vapour under a con¬ 
stant pressure of 1 atmosphere. The heat absorbed (latent heat) was found to be 
41 kJ. Calculate the external work done, assuming that the vapour behaves as a 
perfect gas. Explain the difference between the heat supplied and the external 
work done. 

2.9 Find an expression for the work done when a wire is heated from tempera¬ 
ture T x to temperature T 2 at constant tension ^ 0 . 

2.10 Show that W is not in general a function of state by considering the con¬ 
sequences of a function W(T , V ). 



3. Second Law of Thermodynamics 


The first law of thermodynamics expressed in the form 

U( B) - U( A) = Q + W (3.1) 

has been seen to be a form of the law of conservation of energy in which heat is 
to be treated as a form of energy or, more exactly, as a form of energy transfer 
between systems. The first law is sometimes stated in terms of the impossibility 
of building a certain kind of perpetual motion machine. The machine must work 
in a cycle (section 2.7) and A U is zero at the end of each cycle. The first law 
then requires W = — Q and the efficiency r\ < 1 (equation 2.40). The efficiency 
with which heat can be converted into work in a cyclic process cannot, accord¬ 
ing to the first law of thermodynamics, exceed unity. The fact that real heat 
engines always have r? < 1 was one of the origins of the second law of thermo¬ 
dynamics. 

A thermal process which violates equation 3.1 cannot occur, but it has already 
been remarked for the special case of adiabatic processes, that also many changes 
which do satisfy this equation do not occur. As a further example consider two 
bodies at slightly different temperatures 7\ T 2 inside adiabatic containers 
which are allowed to exchange energy and are then isolated again. The total 
internal energy of the two bodies is unchanged and no work is performed so 


Qi + Qi = 0 (3.2) 

The quantity of heat lost by one body is equal to that gained by the other. The 
first law of thermodynamics does not go beyond this statement. The basis of a 
calorimeter experiment is that the heat lost by the hot body (1) is equal to the 
heat gained by the cold body (2). Heat flows from hot to cold or, equivalently, 
from high to low temperature. The first law fails to select the single direction of 
heat flow between the two bodies. 

A further weakness of the first law is that even in the special form for an 
infinitesimal reversible change it still contains the imperfect differential &Qr. 
The first law of thermodynamics is not therefore very useful for performing 
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calculations on the relationships between the measurable parameters of a system. 
The expression for (C P — Cy) (equation 2.34), for example, contains the term 
(3 U/dV) T which would be difficult to evaluate by experiment for any system 
except a gas. 

The problem of the imperfect differential cT2 R may be stated mathematically 
as follows: show that the first law of thermodynamics in the form of equation 
2.46 or 2.47 has an integrating factor (0 _1 ) such that ctg R /0 is a perfect 
differential. That is to say the differential of some function of state S defined by 

<t>dS = &Q R (3.3) 

where 0 is some function of the coordinates of the system. Since an integrating 
factor does not exist in general (appendix I) for an equation of the form 2.47 
(if n is greater than 2) the postulate that 0 exists must be based on some general¬ 
isation of physical observation, it is not simply a piece of mathematics. 

The history of the discovery of the laws of thermodynamics is briefly dis¬ 
cussed in section 3.8. The second law and the function of state S were originally 
discovered from a consideration of heat engines (section 3.7). The formulation 
of the second law in terms of an integrating factor for cLQr is, however, a more 
direct approach from which the theorems for heat engines can quickly be 
deduced as a special case. The establishment of a function of state S is sufficient 
to simplify the calculation of quantities such as (C P - Cy) (equation 3.14), but 
in order that heat flow and other irreversible processes may be predicted 
correctly, a rule must also be established for the change of S in such a process. 

The first law of thermodynamics essentially involves the conservation of energy 
(A U= 0 for a closed system) but the second law will be seen to be the one-sided 
statement that AS > 0 for a change in a system within adiabatic walls. The 
direction of heat transfer between bodies therefore involves the selection from 
the two states which satisfy the first law (Qi + Q 2 = 0) of the one state for 
which £ is found not to decrease. 


3.1 AN INTEGRATING FACTOR FOR d"Q R 

The first law has been written in equation 2.47 in the form 


cT(2r = X Y r tyr (3-4) 

r= 1 


In a reversible adiabatic change (cf Q R = 0) 
2 Yrdy r = 0 

r -1 


(3.5) 
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where Y r and d y r are defined by equation 2.46. Now an integrating factor does 
not in general exist for an equation of the form of 3.4 if n > 2 (appendix I). The 
assertion that such an integrating factor exists for the first law is therefore a 
statement about Physics (a physical law) not simply a piece of mathematics. The 
correct statement of this problem was first given by Caratheodory and it is useful 
to divide the statement into its mathematical and physical components. 

(1) The mathematical theorem of Caratheodory. An equation of the form 
3.4 is integrable if the neighbourhood of any arbitrary point A contains points 
B inaccessible from A along solution curves of the equation 3.5. 

(2) The physical statement - second law of thermodynamics. In the neigh¬ 
bourhood of any state A of a system within adiabatic walls there are states B 
inaccessible from A. 

The second statement is a generalisation to points mathematically close to a 
given point A of the experimental observation of Joule (discussed in sections 2.3 
and 2.8) that for a system within adiabatic walls 

t/(B) - U(A) = W ad 

but that if state B could be reached from A by performing work on the system, 
then for a system within rigid adiabatic walls it was not possible to return from 
B to A. When the adiabatic walls were non-rigid, states were still found (figure 
2.14) from which the state A was inaccessible. 

The immediate consequence of these two propositions is that an integrating 
factor exists for d"QR. The second law however involves more than this because 
it is a general statement applicable to both reversible and irreversible changes. 

The application to irreversible processes will be discussed in section 3.4 since it 
is first necessary to establish the function S defined by the equation 

dG R =0dS (3.6) 

where 0 and S are some functions of the parameters of the system. It should 
immediately be obvious from the form of equation 3.6 that since 0 and dS occur 
as a product there is no unique function 0 or S. The quantity Q , however, is 
extensive — proportional to the volume of the system — (see section 2.8) and 
therefore 0 will be chosen to be an intensive quantity so that S is also extensive. 
The extensive function S will be called the entropy. The intensive integrating 
factor 0 then has the remarkable property that it is a function of temperature 
alone (appendix I). 

0 = F(0) (3.7) 

where F(0) is called the absolute temperature function since F(6) is the inte¬ 
grating factor for any system at temperature 6. 

The integrating factor for any system at temperature T, where T is defined on 
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the perfect gas scale, will now be shown to be simply T, that is to say the thermo¬ 
dynamic and perfect gas scales are identical. 

The first law of thermodynamics for a perfect gas is written as in equation 
2.31 


The equation of state of the perfect gas is PV-RT for one mole and the internal 
energy is a function only of the temperature. 

Consider 0 as some function of temperature on the perfect gas scale ( T ) such 
that 


r 1 =/co 


Then the second law of thermodynamics may be written 


dS = f(T) 


d T + f(T)PdV 


S is a function of state of, in this case, only two variables so 



(3.8) 


(appendix I). The left-hand side of equation 3.8 is zero since both f(T) and 
(i dU/dT) v are independent of V. The right-hand side is 



d T 


T 


0 


Therefore 

df(T) = _ d T 
f(T ) T 

Therefore 


In f(T) = - In T + constant 


T = 


AT) 


C<t> 


(3.9) 
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The thermodynamic and perfect gas scales are therefore linearly related and if 
both scales are defined to be 273.16 K at the triple point of water, they will 
agree at all other temperatures (section 2.1). An expression equivalent to 

cT{2r = TdS 

was first derived by Clausius using the Camot heat-engine cycle (section 3.7). 


3.2 ENTROPY AS A FUNCTION OF STATE 

The second law of thermodynamics can now be written using equations 3.4, 
3.6 and 3.9 


n 

TdS = £ 

r -1 


Y r dy r 


(3.10) 


in general and for a simple system with only two independent variables (from 
equation 2.33) 


TdS = C v dT + 



dV 


(3.11) 


The equations 3.10 and 3.11 are correct for either a reversible or an irreversible 
change between equilibrium states A and B, since if S is a function of state, 

5(B) — 5(A) is independent of the process by which the system arrived at B from 
A. Only for a reversible change however does d5 = cT Qn/T . Reversible changes 
are considered in section 3.3 and irreversible changes in section 3.4. 

The fact that 5 is a function of state may be used to simplify equation 3.11. 
The equation may be written 


Then 


Therefore 


d5 = 





dT + 


©If©],' 

i = ±r 

T dVdT T L \9 T/v 


['•©I- 

KM"©] 


d 2 u 1 

+ -- 

dTbV 



(3.12) 

v 
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or 



since 

b 2 U = 9 2 U 

bVbT bTbV 


(3.13) 


The important result of this calculation which depends only upon S being some 
(unknown) function of state is that an equation (3.13) has been established from 
which (dUldV) T may be calculated for any simple system if the equation of 
state is known. 

The equation 



- P = T 2 


9 (P 


bT \TJv 


(3.14) 


is called the energy equation. The second form of equation 3.14 shows that 
(i dU/dV) T is zero if the equation of state has the form ,P/T is constant at con¬ 
stant volume, or in general 

P = f(V)T 

This is of course the form of the perfect gas equation PV = RT which, substituted 
into equation 3.14, does indeed lead to ( dU/dV) T = 0 for a perfect gas. 

The second law of thermodynamics for a simple system (equation 3.11) can 
therefore be written 

TdS = C v dT + T[—) (3.15) 

\bT Jv 

or equivalently, since S may be considered as a function of T and V 


TdS = 





dV 


(3.16) 


A comparison of equation 3.15 with 3.16 establishes the important results 



(3.17) 


(3.18) 
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The entropy change of a system at constant volume is from equation 3.15 


S{T*,V)-S(T k ,V) = J 


r B c v 

Ta Y 



(3.19) 


the area under a graph of C v against In T (remember that in general Cy is a 
function of both temperature and volume). 

The condition for C v to be independent of V may be established from 
equation 3.17 



on changing the order of differentiation. Then using equation 3.18 



(3.20) 


The condition for the heat capacity of any system to be independent of 
volume is therefore that ( d 2 P/dT 2 ) v be zero, a condition satisfied by both a 
perfect gas and a van der Waals gas. 

The four important results given in equations 3.14,3.17,3.18 and 3.20 have 
been established simply by using the fact that 5 is a function of state and the 
general mathematical relations between partial differentials. The nature of 5 
(the ‘meaning’ of entropy) is not involved. The results coifld not be obtained 
from the first law of thermodynamics which contains the imperfect differential 

cTCr. 

Since any function of functions of state is also itself a function of state there 
are many relations of the form 3.18 (known as Maxwell relations) some of which 
are derived in section 4.2. Their importance of course is that quantities such as 
(35/9 V)t can always be removed from equations in which they occur and 
replaced by differentials which relate to quantities that are more accessible to 
experiment. 

The function ( dP/dT)y for example may be measured easily for a gas but 
would be difficult to measure for a solid or liquid. However using the relation 



(3.21) 
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(appendix I), and the definitions of the isothermal compressibility (k t ) and 
coefficient of volume expansion (a) 



It should be remembered that a and k t are in general functions of temperature 
and volume. 

These two simple calculations illustrate one of the most important features 
of thermodynamics. The number of independent thermodynamic quantities for 
a system is limited by the form of the second law due to the relationships of the 
type shown in equations 3.12 or 3.24 which must exist between the different 
terms if S is to be a function of state. The manipulation of the mathematical 
properties of the differentials of a function of n variables (n = 2 for a simple 
system) therefore leads to relationships for quantities such as ( C P - Cy) or 
(bS/dV) T in terms of other quantities which are possibly easier to measure 
experimentally. 

Thermodynamics cannot be used to establish the magnitude of any single 
physical quantity although it can, in certain special cases, show that they become 
zero or infinite. Cy and a for example tend to zero as the temperature approaches 
absolute zero (section 4.4) and since (dP/dV) T is zero at the critical point 
(section 2.2) the isothermal compressibility goes to infinity as the temperature 
approaches T c (see also section 10.4). 


3.3 THE CALCULATION OF ENTROPY CHANGES IN PRINCIPLE 

The calculation of the change of entropy of a system between two states A and 
B is straightforward in principle. 


AS 


5(B) - 5(A) = f B 

J A T 


where the integral is independent of the reversible path since S is a function of 
state. 
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In particular, since the entropy is a function of state 


§ 


c!<2r 


= 0 


The entropy was first introduced in terms of dS (equation 3.6) so there is no 
meaning for absolute entropy in thermodynamics but the change in entropy 
5(B) — 5(A) is well defined and independent of the path joining A and B. 

The simplest entropy calculation is for a process occurring at constant 
volume and temperature. A thermal reservoir (section 2.7) at temperature T R 
for example can gain or lose a quantity of heat Q without change of temperature. 
The entropy change of the reservoir is then simply 


AS = SL 
T r 


(3.25) 


Similarly a solid at its melting temperature (7j„) requires a further quantity of 
heat (the latent heat 0 to convert it to liquid and 


AS = 


/ 


(3.26) 


In general the entropy must be considered as a function of two variables (for 
a simple system) say volume and temperature (as in equation 3.19) 

S(V b ,T b )-S(V a ,T a )= T b f B dV 

T a T j F a \dTjv 

(3.27) 


When C v is a function of V it is necessary to integrate the first term on the 
right-hand side of the equation at constant volume (say V A ) and then to evalu¬ 
ate ( dP/dT) v at temperature T B . The result for AS would of course be just the 
same if C v were to be considered at volume V B and ( dP/dT) v evaluated at T A 
since the entropy change is independent of the path (figure 3.1). 

In the particularly simple case of a system where C v is independent of the 
volume, such as a perfect or van der Waals gas, the integration is more straight¬ 
forward. The specific heat of a monatomic perfect gas is independent of volume 
and temperature so C v may be removed from the integral and 
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v 

14 


Vb 


Ta T b t 

Figure 3.1 Two possible paths for the integration of equation 3.27 when Cy is a function of 
both temperature and volume. 

on substitution of R/V for ( dP/dT) v in the second integral of equation 3.27. 
The behaviour of this equation as T A goes to zero indicates that a classical 
perfect gas cannot exist at absolute zero if the entropy is to remain finite. This 
is discussed in section 4.4. Further examples of entropy changes for particular 
processes will be considered in section 3.5 and in the exercises at the end of the 
chapter but in this section we shall concentrate on the principles involved in 
such calculations. 

The entropy change of a system between the states of thermodynamic 
equilibrium A and B, 5(B) — 5(A), is independent of the process by which the 
system goes from A to B, whether by any reversible path or by an irreversible 
process, since 5 is a function of state. The calculation of the entropy change in 
either case is made by finding any convenient reversible path which joins A to B. 

As an example of an irreversible process, consider an isolated volume of a 
perfect gas (figure 3.2) separated from a vacuum by an adiathermal wall. When 
the wall is removed the gas will flow turbulently (irreversibly) into the vacuum 
and after some time come to a new equilibrium state. Since C/is a function only 
of T for a perfect gas, no net work was performed and no heat transferred, the 
temperature of the gas is unchanged. The initial state is (T, V A ) and the final 
state ( T , V B ). The intermediate states are not defined since the system was not 
in thermal equilibrium. The entropy change of the gas may be calculated by 
finding a reversible process which connects the two states. The simplest such 
process is a reversible isothermal change from V A to V B (figure 3.2). Then from 
equation 3.28 

S(V*,T) - S(V A , T) - R ln^ 



(3.29) 
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Figure 3.2 (a) A perfect gas in thermal equilibrium with volume V\ and temperature T. 

(b) The gas flows via non - equilibrium states into the vacuum, (c) The final equilibrium 
state. The irreversible process is replaced by any reversible process joining A to B in 
order to calculate the change of entropy. 


The entropy of the system has increased after the irreversible expansion. Now Q 
was zero in this process, since the entire system was surrounded by adiabatic 
walls, but the entropy has increased. The equation 



for reversible processes must therefore be extended to read 



(3.30) 


where the equality sign is restricted to reversible processes. In an irreversible 
process the temperature T in equation 3.30 is the temperature of the thermal 
reservoir which supplies heat to the system, since the temperature of a non¬ 
equilibrium system is not defined. The increase in the entropy of a system due 
to an irreversible process will now be considered. (The student could usefully 
work exercise 3.2 before reading further.) 
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3.4 PRINCIPLE OF INCREASE OF ENTROPY 

The statement of the second law of thermodynamics given in section 3.1 ‘in the 
neighbourhood of any state A of a system within adiabatic walls there are states 
B inaccessible from A’ was stated to be true for both reversible and irreversible 
processes. A consideration of the first law of thermodynamics for reversible 
processes then led to the identification of a function of state, the entropy S , 
such that 

d 

T 

In any reversible adiabatic process cTQr is zero and hence the entropy of the 
system is constant. The system therefore may be considered to move on the 
surface defined by 

= constant 

in a reversible adiabatic process. In a simple system with only two independent 
variables the surface reduces to a line in the T-V space given for example by the 
equation TV^~ 1 ^ = constant for a perfect gas (equation 3.45). A reversible 
adiabatic process is sometimes called an isentropic process since it takes place 
at constant entropy. 

The essential point in the argument for an irreversible process is to note that 
A must be one of the end points of the region of all those states which are 
accessible from A or else there will be no neighbouring state B which is inaccess¬ 
ible as required by the second law. This process is shown schematically in figure 
3.3. The entropy of a system after an adiabatic change can therefore only stay 
constant or change in one direction. Since in section 3.3 the irreversible expan¬ 
sion of a gas was seen to lead to an increase in the entropy it is clear that the 
entropy must always stay constant or increase in an adiabatic process. The 
condition that the entropy increase (rather than decrease) in fact follows from 
the definition of the thermodynamic temperature as a positive quantity. 

The principle of the increase of entropy may be stated in the form ‘the final 
state (B) of a system within adiabatic walls is never less than the initial state 
(A)’ 


5(B) - 5(A) > 0 (adiabatic) 


(3.31) 


The problem of the direction of heat flow discussed at the beginning of chapter 
3 is now solved. The bodies 1 and 2 exchange heat such that Q\ + Q 2 = 0 
(equation 3.2) and we take 7) to be slightly greater than T 2 . Then 


AS 


^ + &=Q 2 (±-± 

r, t 2 \t 2 7\ 


(3.32) 
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x 

(c) 


Figure 3.3 A schematic illustration of the second law of thermodynamics, (a) In a reversible 
adiabatic process the entropy remains constant (an isentropic process). The point B is 
inaccessible from A. (b) In an irreversible adiabatic process the entropy can only change 
in one direction (increase). The point B is therefore again inaccessible, (c) If the region 
accessible from A included both points of increased and decreased entropy a combina¬ 
tion of irreversible and reversible processes would make all points B accessible. The 
second law of thermodynamics would therefore not be satisfied. 


The requirement that AS > 0 then leads to Q 2 > 0. The cooler body gains heat 
from the warmer body. Since AS increases in equation 3.32, heat flow across a 
finite temperature difference is an irreversible process. 

If the requirement that 7 \ be close to T 2 is relaxed then the entropy change 
must be calculated by finding a reversible path joining initial and final states. 
The reversible path is formed by placing each body separately in contact with a 
series of thermal reservoirs whose temperatures vary continuously between the 
initial and final temperatures. In the most general case of bodies 1 and 2 with 
heat capacities C Vl and Cy 2 (both functions of temperature) and initial tem¬ 
peratures 7\ A and T 2 a (T iA > T 2 A ) the final temperature T B is given by the 
calorimeter equation 

j^ B A c vi d T = - C V2 d T (3.33) 

and the total entropy change by 


'Tb 

Cvi dr ___ j 

•Tb 

Cy 2 d T 

(3.34) 

Tx A 

T ' 

] T 2 a 

T 

•Tb 

Tx a 

Cy\ d (In T) 

♦J 

\ fj 1 

B Cvi d (In 0 
'2A 

(3.35) 


These equations are represented in figure 3.4. AS is always positive. In the 
particularly simple case C Vl = Cy 2 = Cy independent of T , 7b = (7iA + T 2 a)I2, 
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the integration is straightforward 

AS = C v (in— + In —^ 
V T iA T 2 aJ 





Figure 3.4 (a) The final equilibrium temperature of two bodies is given by equal areas under 
the graph of heat capacity against temperature (equation 3.33). (In the figure the heat 
capacities hav6 been taken to be independent of temperature for simplicity.) (b) The 
total entropy change of the two bodies is given by the difference between the areas abed 
and defg when ClT is plotted against the absolute temperature (T) or (c) when C is 
plotted against In T. 
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which is positive definite (remember that the first term is negative since 
7 ia > 7b > T 2A ). In the limit T iA = (T 2A + A), the reader may prove using 
In (1 ± y) ± y -y 1 /2 for small y that 

AS = + (A-* 0 ) (3.36) 

4T b 2 

3.5 THE ENTROPY OF A PERFECT GAS 

The change of entropy of a perfect gas from the initial state (V A , T A ) to the 
final state ( V }) , 7b) was shown to be 

AS = S(F b ,7b)-S(F a ,7a) 

(^\ iV 

Jta t j k a Ur/ V 

(equation 3.27). The thermal capacity of a monatomic gas is independent of 
temperature and therefore (equation 3.28) 

= (3.37) 

The perfect gas is so important in thermal physics — because of its particularly 
simple equation of state (PV = RT) and the fact that the internal energy is 
independent of the volume — that it is worth considering equation 3.37 in more 
detail. It has already been emphasised that since the entropy was introduced by 
a differential (d S = cTQ r/T) there is no absolute value of the entropy, only 
entropy changes are completely defined. If the state (F A , 7a) were to be taken 
as a standard reference state however, equation 3.37 might be written 

S(V , T) = C v lnT + RlnV + S 0 (3.38) 

So = %J) - C K lnT A - R lnV A 

Equation 3.38 is of course correct and is often found in textbooks, but it is 
rather misleading. The appearance of In T and In Kis mathematical nonsense 
unless the definition of S 0 is remembered, but more seriously the entropy as 
defined by equation 3.38 is non-extensive. 

This becomes obvious if the heat capacity Cy is replaced by the specific heat 
per molecule cy and the number of molecules N and the gas constant replaced 
by Nk where k is the Boltzmann constant. 
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S(V , T) = Nc v In T + Aft In V + S 0 (3.39) 

If the volume of the system is divided by two, the first term on the right is also 
halved, since T is constant throughout the volume, but the second term becomes 
( Nk/2)ln(yiX )• The logarithmic terms must therefore both be made intensive if 
the entropy is to be proportional to N. 

This problem can be overcome if the reference state is defined in terms of the 
intensive parameters (P A , T A ). Then using P A V A = NkT A equation 3.39 becomes 

S(V,T) = Nc v \nT + Nk In + Ns o (3.40) 

Ns 0 = S(V a ,T a ) - Nc v \nT A - Aft In 

= S(V A , 7\) - Nc v In T A - Nk In 

Now N/V, the number of particles per unit volume, is constant throughout the 
system since gravitational forces have not been considered, so n = N/V is an 
intensive quantity. The entropy is therefore properly extensive (proportional to 
AO in equation 3.40. 

The difference between equations 3.39 and 3.40 may seem to be rather 
trivial, but a great deal of confusion can arise if calculations are performed in a 
manner which does not make the extensive nature of the entropy obvious. Con¬ 
sider for example two equal volumes of a perfect gas at the same temperature 
and pressure in adiabatic enclosures [figure 3.5(a)]. What is the change in the 
entropy of the system if the partition between the two volumes of gas is with¬ 
drawn reversibly? The answer is clearly zero since nothing has changed in the 
gas after the removal of the partition and this is confirmed immediately by 
equation 3.40. The temperature of the two volumes of gas ( T ) is unchanged and 
if the number of molecules in each container was N/2 and the volume V then the 
original entropies of the two volumes may be written 

o o N f ^ N, f (V\ N 

<Sai = $A 2 = - cylnT + -kin [—) + -- s 0 

2 2 \N J 2 

the final entropy 

S B = Nc v In T + Nk In + Ns 0 


(~ A ) 

WJ 

© <“» 


and the entropy change 


AS - Sb — (S A i + S A2 ) - 0 


(3.42) 




Figure 3.5 (a) The entropy of two equal volumes of a single gas at the same temperature 
and pressure is unchanged after the partition is removed from within the adiabatic 
enclosure (b), because the number density of the molecules is unchanged, (c) The 
entropy increases if two different gases are mixed (d). 


The partition could of course equally well be inserted reversibly into the large 
volume of figure 3.5(b) again without any change in the entropy. The important 
parameter is the number density of the molecules ( N/V ) not the number or the 
volume separately, as is clearly shown by equation 3.40. A further discussion of 
this experiment, (from the microscopic point of view) is given in section 7.5. 
The important point that then becomes clear is that the entropy change is zero 
because the molecules of the gas are indistinguishable, but this point has already 
really been established by the above equations which are concerned only with 
(N/V) and not with which particular molecules of the gas make up N in any 
given V. 

When two different perfect gases are contained in equal volumes at the same 
temperature and pressure within adiabatic walls [figure 3.5(c)] the entropy does 
increase after the partition has been removed since ( N/V ) for each type of 
molecule decreases by a factor 2. The irreversible mixing of the two gases has to 
be replaced by a reversible process in order to calculate the entropy change. It 
was first shown by Gibbs that the entropy change could be calculated by con¬ 
sidering the movement across the total volume of a membrane which allowed 
molecules say of gas 1 but not of gas 2 to pass through it. The mixture shown 
in figure 3.5(d) could therefore be separated as in figure 3.5(c). The over-all 
effect however is simply equivalent to the free expansion of each gas from a 
volume Fto a volume 2V . The molecules of a perfect gas essentially ignore each 
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other. Then writing the number of molecules of type 1 as N t 


5ai = N x cy In T + N x k\n ^ — I + N x Sq 


S A2 = N 2 cvlnT + N 2 k In / — ) + N 2 s 0 

\n 2 J 


S B = (TV, + N 2 )c v lnT + W,*ln 


+ N 2 k In J + (TV, + JV 2 )s 0 . 

Since the number of molecules of type 1 is equal to the number of type 2 the 
entropy change may be written using lAf x I = lAj I = N/2 

AS = S h - (S A1 + S A2 ) = (N t + N 2 )k\nl = Nk\rx2 

(3.43) 


The entropy of mixing of two gases is therefore positive (an irreversible process) 
as expected since reinserting the partition in figure 3.5(d) does not lead to the 
original state of the system. 


3.6 ADIABATIC EQUATION FOR A PERFECT GAS 

The equation of the path of a perfect gas in a reversible adiabatic process (a 
process at constant entropy or an isentropic process) may be calculated from the 
general form of the second law of thermodynamics for a perfect gas. 

TdS = C v dT + PdV 

In an isentropic process dS 1 is zero and 


Cy dT = — P dV = — 


RT 

V 


dV 


Cv dT 
T 



dV 

V 


When Cy is independent of temperature (a monatomic gas) the equation becomes 
simply 


f = R f dF 

J T Cy J V 
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or 


TV r I°v = constant (3*44) 

Since R = (C P - C v ) (equation 2.35), equation 3.44 becomes 

jyi 7-0 = constant (3.45) 

where y ~ Cp/Cy . Since all the states of thermal equilibrium of the gas are 
related by the equation PV/T = constant, equation 3.45 may also be written 

PV y = constant (3.46) 


or 


p(i 7 ) ji - constant (3.47) 

These equations are required in the study of the Carnot cycle for a perfect gas 
(section 3.7) and are also important since a measurement of y provides some 
evidence for the number of atoms in a molecule of a gas (section 7.5). 


3.7 THE CARNOT THEOREMS FOR HEAT ENGINES 


A heat engine was introduced in section 2.7 as a machine which after each 
complete cycle has converted some quantity of heat into useful work. The 
maximum efficiency of such a machine can now be established using the state¬ 
ment that the entropy of a system within adiabatic walls cannot decrease (sec¬ 
tion 3.4). Consider first a heat engine working between two large thermal 
reservoirs at temperatures T A and T B ( T A > T B ). Then if the engine is to be 
reversible all heat must be taken in at T A and rejected at 7 b to avoid heat trans¬ 
fer across a temperature gradient. A cycle therefore consists of two isothermals 
at T a and T B joined by two reversible adiabatic changes (figure 3.6). The 
efficiency of the engine is given by equation 2.39 as 


= 2 a _1_2b 

Qb 


(3.48) 


where Q A (Q B ) is the quantity of heat absorbed (rejected) by the engine at 
temperature 7a(7b). After a complete cycle, A U is zero, so the overall entropy 
change for the system of engine plus reservoirs is 


AS = - 


Qa 


+ 


Qb 

T b 



Second Law of Thermodynamics 


63 



Figure 3.6 The only reversible cycle for an engine working between two infinite heat reservoirs 
at T A and 7 b consists of a reversible isothermal expansion at T A> a reversible adiabatic 
expansion to temperature 7 b, an isothermal contraction at 7 b and a final adiabatic con¬ 
traction. In this way no heat is transferred across a finite temperature difference. The 
shape of the P- V diagram will depend upon the working substance but the efficiency of 
the engine is independent of the working substance. 


In a reversible process AS is zero and 


— = — (reversible) 
Gb 


(3.49) 


Hence the efficiency of a reversible engine working between maximum tempera¬ 
ture T a and minimum temperature T B is 


n —’ B = lAJzIi (reversible) 

Qa t b 


In general AS > 0 and therefore 


Qa < Ta 

2b t b 

. . T B 
r? < 1- 


(3.50) 


(3.51) 


where the equality is correct for a reversible engine. A reversible engine is there¬ 
fore the most efficient possible engine working between a maximum temperature 
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T a and a minimum temperature T B but always has an efficiency of less than 
unity since T A > 0 K (section 4.4). 

The concept of a reversible heat engine was introduced by Carnot (1824) and 
from the Carnot theorems Kelvin and Clausius first formulated the second law of 
thermodynamics (section 3.8). The historical order has been inverted in this 
book because the second law of thermodynamics applies to all systems and the 
Carnot theorems may then be treated as special applications of the law to heat 
engines. 

The Carnot theorems may be written: 

(1) All reversible engines working between given heat sources have the 
same efficiency, independent of the nature of the working substance. 

(2) A reversible engine is always more efficient than an irreversible engine 
working between the same source and sink. 

The first theorem follows immediately since no assumption about the work¬ 
ing substance (perfect gas, real gas, vapour, etc.) was made in deriving equation 
3.50 and the second theorem follows from equation 3.51. 

A reversible heat engine is often called a Carnot engine and a reversible cycle 
of two isotherms and two adiabats a Carnot cycle. This terminology has however 
led to some confusion and the term reversible heat engine will be. used in this 
book. When the engine works between only two reservoirs, the only possible 
reversible engine is one which follows a Carnot cycle because the engine must 
absorb heat at T A and reject it at T B to prevent a heat transfer across a finite 
temperature gradient. A graph of temperature against entropy for a Carnot cycle 
is simply a rectangle independent of the nature of the working substance since 
two changes occur at constant temperature and two at constant entropy 
(figure 3.7). A Carnot cycle can also be illustrated on a P-V diagram (figure 3.6) 
but the shape of the cycle is then a function of the working substance. A direct 
verification of equation 3.50 may be obtained by considering a perfect gas as the 
working substance (exercise 3.4). 

When a set of temperature reservoirs are available at temperatures between 
T a and 7 b many reversible cycles become possible, all of course with efficiency 
given by equation 3.50. The Carnot cycle is in fact not of any practical import¬ 
ance because the work done per cycle (the area enclosed by the cycle on a P- V 
diagram) is very small. An important practical cycle is due to Stirling. The Carnot 
and Stirling cycles, using a perfect gas as the working substance, are shown in 
figure 3.8. The area of the Stirling cycle is obviously much greater than that of 
the Carnot cycle under the same limiting conditions (exercise 3.4) but the 
efficiencies are identical since the Stirling engine absorbs more heat along the 
isotherm T A than does the Carnot engine. The Stirling engine rejects heat to a 
thermal source of variable temperature along be such that the working substance 
and source are at the same temperature and absorbs an equal amount of heat 
from the source along da. The cycle is therefore reversible since heat is never 
transferred across a finite temperature gradient. 

The simple equation for the efficiency of a heat engine (3.51) is one of the 
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Figure 3.7 The entropy - temperature diagram of a reversible heat engine working between 
two thermal reservoirs at T\ and 7 b is always a rectangle since the temperature is con¬ 
stant for two parts of the cycle and the entropy for the other two. The area of the 
rectangle gives the net heat absorbed in the cycle (exercise 3.6). 



Figure 3.8 When more than two thermal reservoirs are available many reversible cycles 
become possible. The Stirling cycle is of great practical importance because the work 
done per cycle is greater than that of the Carnot cycle (shown as a broken line) under 
the same conditions. The heat absorbed along da in the Stirling cycle is rejected along be. 
The heat is always transferred to a reservoir at the same temperature as the gas. (See also 
exercise 3.4). 
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most remarkable in all science. An engine designer will normally have the 
lowest temperature fixed by the environment of the engine, say river water or 
the atmosphere, and then has only two variables available, the highest tempera¬ 
ture which the materials of the engine will withstand (the metallurgical limit) 
and the degree of irreversibility of the engine due to friction, turbulence in the 
working substance and heat transfer across a finite temperature difference. 

All real engines are irreversible with efficiencies at best about 40 per cent of a 
reversible engine working under the same conditions. The approximate P-V 
diagram for a real Stirling engine is illustrated in figure 3.9. 



Figure 3.9 An approximate P- V diagram for a real Stirling engine. The efficiency is found 
to be about 40 per cent of the theoretical maximum efficiency. 


A refrigerator is a machine which, acting in a cycle, extracts heat from a cold 
body and rejects heat at a higher temperature (figure 3.10). It is of course 
necessary to do work on the refrigerator with a separate engine. The work 
required is 


W - Qa ~ Qb 

where Qa is the heat rejected to the hot reservoir and Qb the heat absorbed 
from the cold reservoir. The coefficient of performance of a refrigerator (CP) is 
defined 


heat extracted from cold reservoir 

CP =- 

work done on the system 

= Qb _ 

Qa ~ Qb (3.52) 
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Figure 3.10 A refrigerator takes heat from a cold body (B) to a hot body (A) with the aid of an 
external engine. 


In the case of a reversible refrigerator, equation 3.49 may be used to give 

CP = ——- (3.53) 

T k - 7 b 

Notice that the coefficient of performance in equation 3.53 is greater than unity. 
A reversible engine has of course the highest possible coefficient of performance 
for a given (7X, 7 b) as may be proved by the same argument by which the 
reversible heat engine was shown to be most efficient. A refrigerator working in 
the Stirling cycle (driven by an electric motor) has become of great importance 
in the liquefaction of gases (section 5.3). 

A heat pump is defined as a machine, working in a cycle, which provides use¬ 
ful heating (say of a room) by extracting heat from a thermal reservoir (outside 
air) at a lower temperature. The process therefore looks like that of the refrigera¬ 
tor shown in figure 3.10 but it is the heat given to the high-temperature reservoir 
which is now of interest. The coefficient of performance for a heat pump is 

heat rejected to hot reservoir _ Q A 

work done on the system g A - Q B 

= 1 + CP (3.54) 

(using equation 3.52 for a refrigerator). A reversible engine is therefore as usual 
most efficient since CP is a maximum for a reversible engine. 

Lord Kelvin first noticed that a reversible heat pump was the most efficient 
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possible method of converting energy (say electricity) into heat but although 
some large buildings have been heated by heat pumps the capital cost of the 
installation has usually more than balanced the saving in running expenses over 
more conventional heating systems. 


3.8 HISTORY OF THERMODYNAMICS 

The view of thermodynamics which has been presented in this chapter and the 
previous one has taken little account of the historical development of the sub¬ 
ject. A scientific theory very rarely develops as straightforwardly as it is presented 
in a textbook, but the history of thermodynamics is perhaps the most compli¬ 
cated of any branch of physics. Although no attempt will be made to cover the 
historical background in detail, it is of some interest to see with the advantage 
of hindsight exactly where the confusion arose that so hindered the development 
of thermodynamics. 

In chapter 2 heat was treated as a derived quantity such that the change in 
the internal energy between two states was always given by 

U( B) - U( A) = Q + W 

The internal energy was recognised as a function of state and W , and therefore 
Q , were shown not to be functions of state. 

Although the nature of the energy transfer due to thermal processes does not 
therefore concern us in the present chapter, it is useful to think of heat on a 
microscopic picture (and therefore outside the realm of thermodynamics) as the 
energy of the system due to the random motion of the molecules (see chapter 8). 
The conversion of heat into work therefore involves, for example, the increase 
in the potential energy of a piston (ordered motion) at the expense of the ran¬ 
dom motion of the molecules. It is therefore not surprising that heat cannot be 
converted to work in a cyclic process with an efficiency of unity (equation 3.50). 

In the early nineteenth century however, heat was considered to consist of 
atoms (caloric) and to be quite distinct from other forms of energy. The con¬ 
servation of caloric therefore required Q to be a function of state and it is 
then possible to derive for example the adiabatic equation for a perfect gas 
PV y = constant (equation 3.46). 

Carnot (1796 -1831) used the caloric theory in 1824 to produce his two 
theorems on the efficiency of heat engines which were in time recognised to be 
of fundamental importance, and it was not until 1850 that Clausius (1822 - 88) 
and William Thomson (later Lord Kelvin) (1824 -1907) realised that in fact the 
Carnot theorems did not depend upon the caloric theory. The very careful 
measurement of the conversion of many different kinds of work into heat by 
Joule (1818-89) in the period 1843 - 8 finally established heat as a form of 
energy transfer. 

Qausius first identified the internal energy and the entropy (rather than 
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quantity of heat) as functions of state and wrote the equation d S >8Q/T . With 
Kelvin he seems to have been the first to state explicitly that two laws of thermo¬ 
dynamics were required. Caratheodory gave the second law of thermodynamics 
in the form used in section 3.1 in 1909. 

Two interesting accounts of the early work in thermodynamics are: Mendoza 
(1961) and Klein (1974). 

3.9 CONCLUSIONS 

The establishment of the second law of thermodynamics and of the entropy as a 
function of state has enormously increased the range of problems that can be 
treated by thermodynamics beyond those that could be handled using only the 
first law. The establishment of the entropy as a function of state was in itself 
sufficient to lead to the derivation of the energy equation (equation 3.14) and a 
consideration of the increase of entropy in irreversible processes then led to the 
correct prediction of the direction of heat flow and the maximum efficiency of 
a heat engine under given conditions. 

The entropy function is not however a very convenient function to work with. 
The statement, ‘in a system within adiabatic walls the entropy cannot decrease’, 
may be applied to any system by enclosing it within a suitable space but most 
experimental science is carried out at constant temperature and pressure. The 
entropy statement then requires the system of thermal reservoir (defining the 
temperature) and experimental system to be treated as a whole. An alternative 
function of state which concerned only the experimental system at temperature 
T would clearly be more directly useful than the entropy. Functions of this type 
(called thermodynamic potentials) are discussed in the next chapter. 


EXERCISES 

3.1 Using the equations of state given in exercise 2.4 find expressions for (a) the 
internal energy and (b) C P - C v . (c) Show that C v is independent of the volume 
for all the equations of state, (d) Find the equations for a reversible adiabatic 
change. 

3.2 Find expressions for the changes of entropy of the system in exercises 
2.5-2.8. 

3.3 Verify equation 3.50 by using (a) a perfect monatomic gas and (b) a van der 
Waals gas as the working substance in a Carnot cycle. 

3.4 An engine contains one mole of a monatomic perfect gas. Show that the 
efficiency of the engine is the same if it is worked in either a Stirling or a Carnot 
cycle between the same maximum and minimum pressures, temperatures and 
volumes but that the work done in the Stirling cycle is greater than that in the 
Carnot cycle. 
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3.5 Show that the area enclosed on an entropy - temperature diagram by a 
reversible engine working in a cycle is equal to the net heat absorbed. 

3.6 Construct a P-V diagram for a Carnot cycle if the working substance has 
the equation of state P = b T 4 = u \3 where U=uV. (Show that the equation for 
an isentropic change is T 3 V = constant.) Show that equation 3.49 still applies. 
(The equations are those for black-body radiation. See section 10.2.) 

3.7(a) A block of constant heat capacity 500 J K -1 is heated to 600 K and then 
placed in a lake at 300 K. Find the change in the entropy of the system of block 
plus lake, (b) The block is reheated to 600 K, placed in boiling water and finally 
replaced in the lake. Find the new change of entropy and explain how the 
change could be reduced to zero. 



4. Further Concepts of 
Thermodynamics 


In chapter 3 the development of the second law of thermodynamics and of the 
entropy as a function of state led to the derivation of equations such as 3.24 
which relate measurable parameters of any system. Thermodynamics is not a 
complete self-contained description of the thermal properties however since to 
obtain the individual quantities, such as the coefficient of thermal expansion of 
the system, a certain amount of information has to be provided either from 
experiment or else from microscopic theory (statistical mechanics, see chapter 7). 
The minimum amount of information which is required before all the parameters 
of a system can be calculated using the laws of thermodynamics is therefore of 
some importance. 

In a simple system only two parameters are required to define the state of the 
system and five variables (T, S , U , P, V) have so far been considered. An equation 
of the form /(x,y), where x and y are chosen from the above five variables, from 
which all the thermodynamic properties of the system can be calculated without 
performing an integration is called a fundamental equation. As has already been 
stressed this equation must be obtained from experimental or statistical mech¬ 
anics. Integration is not an acceptable operation because it would introduce an 
arbitrary constant into the thermodynamic quantities. 


4.1 THE FUNDAMENTAL EQUATIONS 

The fundamental equation for a simple system may be written in a number of 
ways depending upon the choice of the variables x and y. The internal energy 
expressed as a function of entropy and volume, U(S, V ) is a fundamental 
equation. The quantities S and V are called natural variables for U. A knowledge 
of U(x,y) where x andy are not equal to S and V is not a fundamental equation 
since some thermodynamic quantities will then be defined only to within a con¬ 
stant of integration. 
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As a simple example suppose that the equation for the internal energy of a 
monatomic perfect gas as a function of T and V 

U = ^NkT 

has been obtained. Then C v = 3/2 Nk so 

S = J* — dT = jNk In T + constant 

and S is defined only to within the integration constant. If U(S , V) is known 
then the equation of state and all other properties can be calculated without 
performing an integration (exercise 4.1). Similarly S(U, V) is a fundamental 
equation. 

The two fundamental equations U(S, V ) and S(U , V) have natural variables 
which are difficult to evaluate by experiment. The purely mathematical problem 
is to transform, say U(S, V ) to a new fundamental equation with more conven¬ 
ient natural variables. A change of variables of this type is called a Legendre 
transformation and the new function is called a thermodynamic potential. 

The most important thermodynamic potentials for simple systems are 
tabulated in table 4.1 along with their natural variables and differential form. The 

TABLE 4.1 


The definitions and form of the differentials of the enthalpy 
and Gibbs and Helmholtz free energies. 


Name 

Symbol 

Natural 

Variables 

Definition 

Differential 

Helmholtz free 
energy 

F 

(T, V) 

U-TS 

d U - TdS - S dT = 
-PdV-S dT 

Gibbs free energy 

G 

( T,P ) 

F + PV 

dF + PdV+ V dP = 

— S dT + V dP 

Enthalpy 

H 

iS.P) 

U + PV 

dU + P dV + V dP = 

T dS + V dP 


Helmholtz free energy (F) is so called because for a system at constant tempera¬ 
ture, that is to say in thermal contact with a heat reservoir at temperature T, the 
reversible work done in a change of volume from V A to V B is simply 

j^ B PdF = F(B) - F( A) 


That is, the work done in a reversible isothermal process is equal to the decrease 
in the Helmholtz free energy of the system. 
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Notice that dFis zero (Fis constant) for a reversible process at constant 
volume and temperature. The Helmholtz free energy F(P, V ) is perhaps the most 
important of all the thermodynamic potentials because it is the quantity which 
can be calculated most directly by the methods of statistical mechanics (see 
chapter 7). 

Some examples will now be given of the kind of manipulation of a funda¬ 
mental equation required to obtain other thermodynamic quantities. The 
definitions 


F = U - TS 

(4.1) 

dF = - PdV - SdT 

(4.2) 

, (£) a, ♦ (•?) dr 

(4.3) 

\WJt \dT )v 



lead immediately to the identities 



Equation 4.6 is often called the Gibbs - Helmholtz equation. Notice that P, S and 
U are all expressed in terms of F or of differentials of F as required if F(P, V) is 
to be a fundamental equation. 

The specific heat at constant volume also follows immediately from 

- ■ ■ ia ■ - «•> 

Some other relations may be found in exercise 4.2. 

The fact that F is a function of state may be applied (in exactly the same 
way as in equation 3.12) to equation 4.2 to obtain the relation 

(—) = (—) ( 4 . 8 ) 

\bTjv va vJt 

In fact equation 4.8 was obtained by a more indirect method in equation 3.18. 

The Helmholtz free energy has been discussed in some detail because of its 
great importance in theoretical physics. The Gibbs free energy with natural 
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variables (T, P) is of most importance in chemistry since 

dG = VdP - SdT (4.9) 

and therefore a reversible process occurring at constant temperature and 
pressure has dG equal to zero and G constant. The most important application 
of the Gibbs free energy in physics is to changes of phase such as the solid - liquid 
transition (section 10.4). 

The enthalpy is of greatest use in chemical engineering because of its import¬ 
ance in flow processes such as the throttling process discussed in section 5.3. The 
equation 


d H = TdS + VdP = 6Q r + VdP (4.10) 

shows that for a reversible process at constant pressure the change in the 
enthalpy is equal to the quantity of heat transferred. 

■(?),-© 

The fundamental equations have been defined as equations from which all 
other functions of the system can be calculated without performing an integra¬ 
tion. If integration is permitted, that is to say the definition of some terms only 
to within a constant is acceptable, then only the function of state /(P, V , T ) and 
the heat capacity of the system as a function of temperature at one given 
pressure or volume are required. If say C(K A , T ) was known from experiment 
then the entropy could be calculated relative to some reference state ( V A , r A ) 
using equation 3.27 and the volume dependence of the specific heat is found 
from the equation of state using equation 3.20. 


4.2 THE MAXWELL RELATIONS 


The equality between (9 P/dT) v and (3 S/dV) T derived in equation 4.8 using the 
fact that F was a function of state and dF a perfect differential, is one example 
of a set of equations of great practical importance known as the Maxwell equa¬ 
tions. Using equations 4.8,4.9 and 4.10 and the second law of thermodynamics 
in the form dU= TdS - P dV, the four Maxwell relations for simple systems are 



(4.12) 


(4.13) 
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/ar\ = /3F\ 

WJ? ~ W> 

/ar\ _ / 3P\ 

\3F/s ~ V ds jv 


The equations 4.12-4.15 are not of course independent, any of them can be 
manipulated by using the relations of functions of two variables (appendix I) to 
give the other three. The importance of equation 4.12 was discussed in section 
3.2. The left-hand side of equation 4.13 may also be calculated directly from 
the equation of state and be used to replace the quantity — (dS/dP) T in any 
equation in which it occurs. 


4.3 THERMODYNAMIC EQUILIBRIUM 

The second law of thermodynamics was shown in chapter 3 to lead to the con¬ 
clusion that the entropy of a system within adiabatic walls could not decrease. 

In thermal equilibrium the entropy of the system is constant at its maximum 
value for a given internal energy and volume. Consider two subsystems of con¬ 
stant volume within the adiabatic enclosure joined by a diathermal wall. Then 

U = U x + U 2 d U = dU, + dU 2 = 0 (4.16) 


since energy is conserved, and after thermal equilibrium has been established 
between the two subsystems the entropy is a maximum, so 


d S = dSj + dS 2 = 0 

Now 


d S 


'dS i 
.3 Ui 


V, 


d Ui 


+ 




1 

T 2 


dU, 


(4.17) 


using the definition of the absolute temperature from the second law of thermo¬ 
dynamics. The equilibrium condition (equation 4.17) therefore requires that 
T\ = T 2 for thermal equilibrium between the subsystems. In this sense the 
zeroth law of thermodynamics (section 2.1) is not a necessary postulate for the 
construction of thermodynamic^, il i| contained within the second law. The 
equilibrium state has not been shown to be stable since only the fact that 
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T 


Figure 4.1 A system (S) of fixed volume in thermal equilibrium at temperature T may be 
treated either by considering it as attached to a thermal reservoir (R) within adiabatic 
walls, (the total internal energy is then constant and the entropy a maximum) or as a 
system with defined temperature T and volume V for which the Helmholtz free energy 
is a minimum. 


dS = 0 has been used so far. The maximum condition on the entropy also 
requires d 2 S < 0 but this will not be proved here. 

The equilibrium condition for a system in contact with a thermal reservoir 
at temperature T R can also be written in terms of the entropy maximum of the 
combination of system and reservoir enclosed within adiabatic walls (figure 4.1) 
but it is obviously more convenient to express the equilibrium condition in 
terms only of the parameters of the system of interest and the temperature of 
the reservoir (7 r). The enclosed combination of thermal reservoir and system of 
interest has, by analogy with equations 4.16 and 4.17 

d (U + C/ R ) = 0 d(S + S R ) = 0 (4.18) 

subject now also to the condition that the temperature always be T R . Now at 
constant volume dU R = T R dS R and therefore using equation 4.18 

dU + 7r diS*R — dU — T R dS = 0 


Therefore 


d{U - T r S) = 0 (4.19) 

since 7 r is constant. The condition for thermal equilibrium for a system at 
constant volume in contact with a thermal reservoir is therefore given by equa¬ 
tion 4.19, which is equivalent to dF = 0 where F = U - TS is the Helmholtz free 
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energy. A consideration of d 2 F shows that F must be a minimum. Similarly for 
a system in contact with a thermal reservoir and maintained at constant pressure 
the Gibbs free energy is a minimum (exercise 4 A). 

An example may clarify the concept of the Helmholtz free energy being a 
minimum subject to the conditions that T and Fbe constant. In exercise 6.6 it 
is shown that a perfect crystal, a regular array of atoms, cannot exist at tempera¬ 
tures above absolute zero. If n atoms are removed from their sites, involving an 
energy expenditure of ne then the Helmholtz free energy is 

F = ne - kT[N In N - (N-n) In (N-n) ~nlnn] (4.20) 

where N is the total number of sites in the perfect crystal. The number of 
defects ( n ) of a crystal in thermal equilibrium at temperature T is now found by 
setting ( dF/dn) T equal to zero in equation 4.20. 


4.4 THIRD LAW OF THERMODYNAMICS 


The first and second laws of thermodynamics which have been developed in 
earlier sections are sufficient to show that the thermodynamic temperature can 
never be negative (once the temperature at the triple point of water was defined 
to be positive) although it is possible to describe the behaviour of certain sub¬ 
systems by a negative temperature under special conditions (section 10.5). The 
lowest thermodynamic temperature is therefore 0 K. The entropy change of a 
system at constant volume cooled from T A to 7 b, given by 


AS 


f 


r B Cy 

T A 


would therefore become infinite if C v were independent of temperature (as was 
predicted by classical physics) and T B were set equal to zero. The experimental 
measurements of the specific heats of metals at low temperatures however are of 
the form 


C v = AT 3 + yT (4.21) 

where A and y aiv, constants (figure 4.2). (See also section 10.3.) Then 
( C V \T ) -» y as T-+ 0 K and the integral remains finite. 

The third law of thermodynamics was originally developed by Nernst for 
chemical changes and like the second law has been expressed in a number of 
different forms. The most useful perhaps from the point of view of physics is: 

‘as the temperature approaches zero the entropy of any system becomes 
independent of the parameters of the system’. 


TU o K S-+s Q 


(4.22) 
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Figure 4.2 The specific heat of a metal (divided by the temperature) at low temperature as a 
function of T 2 . The straight line has the form of equation 4.21. The intercept represents 
the contribution of ‘free’ electrons (section 10.1) and is not present for non-metals. See 
also figure 10.9 


where S 0 is a constant. Since the thermodynamic temperature is equal to 
(dU/dS)v this is equivalent to 



Since S 0 is a constant, the differentials of the entropy tend to zero as T o k. 
Now using equations 4.13 and 3.23 


dS 


bPj T 



(4.23) 


The coefficient of thermal expansion therefore goes to zero as 71. 0 k (figure 
4.3). Similarly using equation 4.12 


dS ' 
dVj T 


dP\ 

dT/v 


(4.24) 


goes to zero (figure 4.4). The isothermal compressibility, l/V(dV/dP) T however 
goes to a constant value not to zero. 

All specific heats certainly go to zero since 
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T 


Figure 4.3 The coefficient of volume expansion goes to zero at low temperature. 



0 10 20 30 

T (mK) 


Figure 4.4 (bP/b Dagoes to zero at low temperature as shown here for 3 He. See also 
section 10.4. (After W. J. Huiskamp and O. V. Lounasmaa Rep. ProgPhys. 36 (1973) 
p. 423). 


and (dS/dT'k must remain finite as T -> o K • According to the experimental 
equation (4.21) however, for a metal 



approaches the constant value y rather than zero as ^0 K- 

In fact on the basis of statistical mechanics (section 7.8), ( dS/bT)v does 
approach zero because equation 4.21 is incorrect at temperatures extremely 
close to absolute zero (« 10“ 10 K) which cannot be reached by experiment. The 
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specific heat then decreases exponentially to zero and (Cy/Tf^Q as 7L+ 0 K . 

The temperatures from which it is safe to extrapolate measurements to absolute 
zero must therefore always be carefully considered. 

A classical perfect gas cannot exist at low temperatures since the specific heat 
of the gas is independent of the temperature and the entropy would diverge as 
the temperature goes to zero (equation 3.28). The quantum-mechanical gases 
discussed in section 10.1 could however in principle continue to exist to the 
lowest temperatures. In fact only the two isotopes of helium remain even as 
liquids (at atmospheric pressure) at the lowest temperatures attainable by 
experiment (see sections 10.1 and 10.4). 

The first and second laws of thermodynamics have both been expressed in 
earlier sections in terms of the impossibility of constructing a perpetual motion 
machine. The third law can also be written in a restrictive manner: ‘it is not 
possible to cool a substance to the temperature T = 0 K.’ The distinction is to 
be made between approaching 0 K as closely as desired and actually attaining 
the mathematical value of zero. This statement is consistent with the earlier one 
that S-*s q as will now be shown. The most efficient method of cooling the 
system is a reversible adiabatic expansion for which dS = 0. Then from equation 
3.15 


TdS = C v dT + t(-) dV = 0 
\BTjv 

The temperature drop in a reversible adiabatic expansion is therefore 



Now since ( dP/dT) v goes to zero as 71> 0 k (equation-4.24) because S the 
temperature T - 0 K cannot be arrived at from any finite temperature T. 

The value of S 0 has been left undefined so far and cannot be evaluated on the 
basis of thermodynamics. In chapter 6 it is shown that the entropy of a system 
in true thermodynamic equilibrium does appear to approach zero as T^ 0 k. This 
statement is however only useful for pure crystals since diffusion processes in 
solids are so slow that an alloy will always be in a metastable state at low 
temperature. Similarly, systems which form glasses or amorphous solids rather 
than perfect crystals will have S-+ So as T^ 0 K but S 0 is not equal to zero. 
Thermodynamics leads only to values for the difference of the entropy of a 
system in two given equilibrium states since the entropy was originally intro¬ 
duced in the form of the differential d S = cTQr/7\ 
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EXERCISES 

4.1 The fundament equation U(S, V ) of a monatomic perfect gas may be 
written 



where a is a constant. Show that the equation of state PV = RT i the principle 
specific heats and the fundamental equation in the form F(T , V) may all be 
derived from this equation. 

4.2 Derive expressions for the isothermal compressibility and the difference 
(Q> - Cy) in terms of F{T , V ). 

4.3 Show that Cp = -T\ ^ ) 

13 T 2 p 


4.4 Show that the equilibrium condition for a system at constant temperature 
and pressure may be written d G = 0. 



5. Further Applications of 
Thermodynamics 


The fundamental equations of thermodynamics have been established in 
chapters 2-4. Three important applications of these equations are discussed 
in this chapter and other applications are considered using both thermodynamics 
and statistical mechanics in chapter 10. 


5.1 REDUCTION OF MEASUREMENTS TO CONSTANT VOLUME 

Theoretical calculations in solid-state physics are always made for systems at 
constant volume but experiments on properties such as the specific heat are 
normally made at constant (atmospheric) pressure. The pressure required to 
maintain a system at constant volume can be calculated by considering the 
volume as a function of temperature and pressure 

At constant volume the left-hand side is zero and 

rp 

A P = P(T b ) - P(r A ) = - f ® 

j T a 

is the increase in pressure required to keep the volume constant as the tempera¬ 
ture increases from T A to 7 b- This equation may be written 



- 1 dT 


/• Tb 

A p = aK T 


( 5 . 1 ) 
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using equations 3.22 and 3.23. When the system is a perfect gas, equation 5.1 
becomes simply 


R P AT 

A P = - (T b - r A ) (5.2) 

V T 

so the fractional increase in the pressure to hold the system at constant volume 
is simply equal to the fractional change in the thermodynamic temperature. 

The pressure required to hold a solid at constant volume depends upon the 
temperature of the system since the coefficient of thermal expansion goes to 
zero at absolute zero (equation 4.23) but is usually only weakly dependent on 
temperature near room temperature. The isothermal compressibility of a solid 
is constant at low temperatures but increases slowly with temperature. At room 
temperature therefore equation 5.1 becomes, for a small change in temperature 
AT 


A P = ok t - 1 AT (5.3) 

The values a = 50 x 10 -6 K -1 , n T ~ l = 14 x 10 10 N m -2 for copper at room 
temperature are typical values for solids. Then a 1 K rise in temperature requires 
the pressure to change by 7 x 10 6 N m“ 2 (roughly 70 atmospheres) to hold the 
system at constant volume. The system would therefore have to be enclosed 
within a high pressure press if direct measurements were to be made at constant 
volume but by using the laws of thermodynamics a general thermodynamic 
result will now be obtained relating measurements at constant pressure to those 
at constant volume. 

Suppose that some equilibrium property of a system <t> has been measured as 
a function of temperature at some pressure P 0 . The function 4>(r,/o) is there¬ 
fore known and the function <£(r, V) may be related to it by the usual expres¬ 
sions 


d4> 



dV 


a<A _ //3$\ / a 

dTJv WAW> 


(5.4) 


Then 
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The second term on each side of equation 5.5 will cancel if (P 0 , V 0 , T 0 ) defines 
a state of the system. The evaluation of the integral requires the temperature 
dependence of the isothermal compressibility (usually small for a solid), the 
coefficient of expansion and the pressure dependence of the quantity to be 
evaluated. It is not therefore necessary to hold the volume of the system con¬ 
stant but the pressure apparatus must achieve a sufficiently high pressure for 
(3 &/dP) T and ( dV/dP) T to be evaluated. The equation simplifies considerably 
when only the volume in the integral is found to be a function of temperature 

HVo,T) = HPo,T)~ (0) t k t ~ 1 ln ( 7 ) (5 6) 

The correction to constant volume at a given temperature therefore requires a 
minimum of four measurements, for two of which high-pressure apparatus is 
required. The important point is that these quantities can all be evaluated 
separately, perhaps in different laboratories, and then the final correction made. 


5.2 THE PRINCIPAL SPECIFIC HEATS 

The relationship between the specific heats at constant pressure and constant 
volume is of particular importance since the specific heat is the most easily 
measureable thermal quantity. The required equation has already been estab¬ 
lished (equations 2.34 and 3.14) and can also be obtained directly from equation 
5.4 by taking $ to be the entropy. Then 



using the definitions of the specific heats and the Maxwell relation (equation 
4.12). The right-hand side of equation 5.7 may be rewritten using the standard 
relationship 



(5.8) 
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Since (bP/bV) T is never positive for a system in thermal equilibrium (see 
section 2.2) and ( dV/dT) P cannot be negative because of the squared term, 
equation 5.10 establishes the important result that C P > Cy. The specific heat 
at constant pressure is equal to that at constant volume if ( dV/dT) P is zero, as 
occurs for water near 4 °C, and (C P - C v ) also tends to zero at low temperatures 
since the coefficient of thermal expansion (a) is zero at T = 0 K but the iso¬ 
thermal compressibility (k t ) is not. In fact ( C P — C v ) goes to zero more rapidly 
than either C P or Cy separately since (Cy/a) is found to be a constant at low 
temperatures. The specific heat of an insulator, for example is proportional to 
T 3 at low temperatures (see section 10.3) so (C P - Cy) ^ T 7 . 

The behaviour of the specific heat at the critical point is also of importance. 
At the critical point 



(section 2.2) so the isothermal compressibility goes to infinity. Since the right- 
hand side of equation 5.7 may be rewritten using equation 5.8 


C P 



(5.11) 


(Q> - Cy) goes to infinity at the critical point. Experiments suggest that the 
specific heat at the critical volume also goes to infinity as the temperature 
approaches T c but the divergence is much weaker for C v than for C P (section 
10.4). 

The high isothermal compressibility near the critical point leads to large 
fluctuations in the density of the system over regions large enough to cause the 
scattering of a beam of light passing through the system, a phenomenon known 
as critical opalescence. 

The correction of the measured specific heat (Cp) as a function of temperature 
to the specific heat at constant volume using equation 5.10 requires the complete 
temperature dependence of the volume per unit mass, the coefficient of thermal 
expansion and the isothermal compressibility. This large amount of experimental 
information is rarely available but fortunately the quantity 


A = 


VOL 2 

KpCp 2 
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is found to be nearly independent of temperature (Zemansky, 1968). The equa¬ 
tion then becomes 


C P - C v = AC P 2 T (5.12) 

The values of V, a and Kp for the system are therefore now required at only one 
temperature. Equation 5.12 is called the Nernst - Lindemann equation (see 
exercise 5.1). 

A discussion of the specific heat of solids as a function of temperature using 
statistical mechanics will be found in section 10.3. The essential features are that 
both Cp and Cy go to zero at absolute zero and that Cy tends to a constant 
value at sufficiently high temperature. The specific heat at constant pressure 
however does not become constant at high temperature, as may be seen from 
equation 5.12. 


5.3 COOLING AND LIQUEFACTION OF GASES 

The liquefaction of the so-called permanent gases such as air, nitrogen, helium, 
is one of the most important applications of thermodynamics both for industrial 
uses and for low-temperature research. The essential feature of the gas - liquid 
transition is that a gas cannot be liquefied by increasing the pressure unless the 
temperature is below the critical temperature (section 2.2). The simplest method 
of liquefying a gas is therefore to place it in contact with a reservoir at a 
sufficiently low temperature. For example a Stirling engine refrigerator (section 
3 .7) using hydrogen or helium as a working substance can run at a sufficiently 
low temperature (75 K) for air to liquefy at atmospheric pressure. 

Alternatively a gas may cool after a free expansion, an adiabatic expansion, 
or a throttling process, although only the reversible adiabatic expansion invari¬ 
ably leads to a decrease in temperature as will now be shown. The free expansion 
of a gas was discussed in section 3.3. The internal energy of the gas is constant 
in this process 


d U 


d U\ 

— | dV + 
dV/p 


w 

bT 


d T 
v 


0 


Now from equations 2.32 and 3.14 
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Therefore 


C v dT + 


wa -'] 


dV = 0 


c y B [r(^) -f' 

7(B) - 7(A) = - | f B _ d y (s . 13) 

A L cy 


A free expansion always leads to a cooling of a van der Waals gas (exercise 5.2) 
but occurs without change of temperature for a perfect gas as has already been 
seen in section 3.3 since ( bU/dV) T is zero for a perfect gas. 

A reversible adiabatic expansion of any gas always leads to a decrease in 
temperature. Since the process occurs at constant entropy the important para¬ 
meter is ( bT/dV ) s . Using the Maxwell-relation equation (4.15) 




Cv \w)t \dT)p 


(5.14) 


Since ( dP/dV) T < 0 for equilibrium and (dV/d T)p > 0 for a gas, (dT/dV) s < 0. 
A gas always cools in a reversible adiabatic expansion and the cooling is greater 
than for a given volume change in a free expansion as may be seen by comparing 
equations 5.13 and 5.14. 

The temperature of a gas may therefore be reduced by using it as the working 
substance in a suitable engine provided that the operation is sufficiently close to 
that of a reversible engine. However at low temperature friction is always a 
serious problem because normal engine lubricants cannot be used and only gas 
lubrication of the engine pistons is possible. The liquefaction of helium is there¬ 
fore carried out in the Collins liquefier by reducing the temperature to 10 K in 
an expansion engine and then achieving the final temperature drop to 4 K in the 
throttling process which will now be described. 

The principle of a throttling process is illustrated in figure 5.1. A gas at 
pressure P A is initially contained within an adiabatic enclosure, one wall of 
which is formed by a piston behind a porous plug. The two pistons are now 
moved so that the pressure on the left of the plug is always P A and on the right 
of the plug P B as shown in the figure. The final equilibrium state is (P B , F B ). The 
passage of the gas through the porous plug is an irreversible process but thermo- 
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Figure 5.1 In a throttling process the gas is thermally isolated and the pistons moved so that 
the pressure is always P A on the left-hand side of the porous plug and P B on the right- 
hand side. The gas passes through non - equilibrium states in the porous plug but thermo¬ 
dynamics may be applied to the initial and final equilibrium states. 


dynamics may be used to find the relationship between the initial and final 
equilibrium states. 

Since the whole system is thermally isolated, Q is zero and the change in the 
internal energy of the gas is equal to the net work done on the gas. That is 

fO C V B 

U(B) - U( A) = - J ^ P a dV - ) q P b dV 
= PaV a - P b V b 

since both changes take place at constant pressure. Therefore 

U{ B) + P b ^b = U{ A) + P A V A (5.15) 

Since the enthalpy ( H ) is defined as U + PV (table 4.1) equation 5.15 is equi¬ 
valent to the statement that the initial and final states of a gas in a throttling 
process are states of equal enthalpy. The intermediate states which the gas 
passes through in the porous plug are non-equilibrium states to which 
thermo-dynamics cannot be applied. The entropy of the gas therefore increases 
in a throttling process. Consider unit mass of gas passing through the plug. Then 

T ds = du + P du 

where the lower-case symbol is used to indicate unit mass. Now 


dh = du + P du + v dP 
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Therefore 


Tds = dh - vdP (5.16) 

The initial and final states of the gas have the same enthalpy, so dh is zero but 
d P is negative since the pressure decreases. The entropy of unit mass of gas there¬ 
fore increases in a throttling process by an amount 

.(B) - .(A) - - ^ (5.17) 

The integral is as usual evaluated for any reversible path joining and Pa and 
may be calculated if the equation of state of the gas is known. A perfect gas has 
the particularly simple relation 


s(B) - s(A) = - R 



d P 
P 


= flln 


'Pa 

Pb 


> 0 


(5.18) 


The temperature change between initial and final equilibrium states in a 
throttling process is due to a change of pressure at constant enthalpy, (dr/3P) h . 
Using equation 5.16 


a-(a- 

Therefore 

-a©-a- 

© ■ i re), -] 


The last equation follows from the identity 


(5.19) 


C P = T 



(5.20) 


(equation 4.11) and the Maxwell relation (equation 4.13). The quantity ( dT/bP% 
is called the Joule - Kelvin coefficient since they first performed the porous plug 
experiment. The temperature change after a finite pressure drop may be found 
by integrating equation 5.19. After a small pressure drop A P, the temperature 
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(5.21) 


Since Cp is positive and A P negative (pressure drop ) the gas warms if the 
bracket is negative. 

The temperature change for a perfect gas is zero but for a van der Waals gas 
(ignoring second-order terms) 



A P 


(5.22) 


At low temperature the bracket is positive and the gas cools, at a temperature 
equal to la/Rb there is no change of temperature and at higher temperature the 
gas warms. The temperature at which there is no temperature change after the 
throttling process is called the inversion temperature. The inversion temperature 
for the van der Waals gas is related to the critical temperature, using equation 
2.14, and to the Boyle temperature 

Ti=—T c = 2 Tq (5.23) 

4 

The general equation of the inversion curve is found by setting AT equal to 
zero in equation 5.21 


T 



= v 


(5.24) 


In a real gas the inversion temperature (given by equation 5.24) is a function of 
the pressure, as shown schematically in figure 5.2. The inversion curve is shown 
as a dotted line. The maximum cooling occurs for a given T A if the pressure P A 
is such that ( P A , 7a) lies on the inversion curve. There is a maximum inversion 
temperature above which no cooling is possible. The relationship between the 
maximum inversion temperature and the Boyle temperature for a van der Waals 
gas (given by equation 5.23) is also correct to within 10 per cent for real gases. 

The change of temperature of a real gas in a Joule - Kelvin expansion depends 
upon the sum of the differences from Boyle’s law and Joule’s law for a perfect 
gas (exercise 5.3). The inversion curve is therefore a stringent test of an equation 
of state for a real gas. An equation of the form PV = RT + BpP + CpP 2 is inade¬ 
quate (exercise 5.5) at low temperatures because the calculated pressure on the 
inversion curve goes to infinity instead of showing a maximum as in figure 5.2. 

The most difficult gas to liquefy is helium, which has a critical temperature 
of 5 K and a maximum inversion temperature of about 50 K. The boiling point 
of liquid helium is 4.2 K at normal pressure. The problem of producing large 
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Figure 5.2 In a real gas the inversion temperature is a function of pressure. The full lines ; 
are curves of constant enthalpy, the dotted line the inversion curve. The maximum 
inversion temperature occurs when the inversion curve reaches the 7"-axis. Since the 
initial and final states of a Joule - Kelvin expansion lie on a curve of constant enthalpy 
it can be seen that the maximum cooling occurs if the initial state lies on the inversion 
curve. 

quantities of liquid helium was first solved by Collins. The pure helium gas is 
compressed to a pressure of 17 atmospheres at a temperature of 77 K using 
liquid nitrogen as a coolant. The liquid nitrogen may be obtained from air 
liquefied using a Stirling engine refrigerator, as briefly discussed in section 3.7. 
The gas is then cooled to 10 K using two expansion engines and delivered to the 
Joule - Kelvin valve at 17 atmospheres pressure. The helium then cools to 4.2 K 
at a pressure of 1 atmosphere. The efficiency of the liquefler is increased by 
using the cold helium gas to cool the incoming gas — a counter-current heat 
exchanger. The advantage of the Joule-Kelvin expansion is that it requires no 
moving parts. The temperature drop for a given pressure drop is less than that 
for a reversible adiabatic expansion (exercise 5.6), but the problem of lubricating 
the moving parts of an expansion engine at low temperature is avoided. 

Low-temperature physics only became possible with the development of 
helium liquefiers. Many experiments are simply carried out with the sample 
immersed in liquid helium at 4.2 K, but by reducing the pressure above the 
liquid using a large vacuum pump the temperature can be reduced to about 
0.6 K. At lower temperature normal helium ( 4 He) becomes a superfluid and 
faster pumping does not lower the temperature further because of the very large 
thermal conductivity of 4 He in this region. The rare isotope of helium, 3 He 
remains a normal fluid to much lower temperature (sections 10.1 and 10.4) 
and may be used to reduce the temperature to about 0.3 K. A refrigerator using 
mixtures of 3 He and 4 He may also be used down to temperatures of about 
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0.01 K (10 mK). Methods of reaching still lower temperatures are discussed in 
sections 10.4 and 10.6. 


EXERCISES 

5.1 Find the constants in equation 5.12 for copper at 300 K using the follow¬ 
ing data: Molar volume 7.06 x 10“ 6 m 3 . C P = 24.5 J K -1 mol -1 

a = 4.92 x lO-^K' 1 k t = 7.76 x 10" 12 N _1 m 2 

At 800 K C P = 28.0 J K -1 . Find the value of C v at 300 K and 800 K. 

5.2 Find the change in temperature of 1 mole of a van-der Waals gas after a free 
expansion from 10 -3 m 3 to 2 x 10 -3 m 3 . Take a = 0.14 N m 4 mol -2 . 

5.3 Show that equation 5.21 for the change of temperature in a Joule - Kelvin 
expansion is equivalent to 



The temperature change of a gas in the Joule - Kelvin expansion is due to the 
sum of the deviations from the two perfect gas laws. 

5.4 Derive equation 5.22 and hence find the change of temperature of a van der 
Waals gas at the Boyle temperature after a Joule - Kelvin expansion. 

5.5 The virial equation may be written: 

PV = RT + B P P + C P P 2 + D P P 3 + ... 

Find the equation of the inversion curve and show that an equation involving 
only Bp and Cp is inadequate at low temperatures. (The results of exercise 2.2 
relating Bp and Cp to the van der Waals constants may be used.) 

5.6 Show that for a given pressure change the cooling in a Joule - Kelvin expan¬ 
sion is always less than that due to a reversible adiabatic expansion. 



Conclusion to Part I 


Thermodynamics has been seen in the last two chapters to be of importance 
both for pure science and for its applications. A language has been constructed 
which can describe the equilibrium states of any system and can also predict the 
observed transitions between the various states which satisfy the law of the con¬ 
servation of energy. The application of the second and third laws of thermo¬ 
dynamics together with the Maxwell relations and the mathematics of functions 
of two variables has led to general results for such practical problems as the 
reduction of measurements to constant volume and the liquefaction of gases. 
Further examples of the applications of thermodynamics will be given in 
chapter 10, after a microscopic view has been developed in part II. 

Thermodynamics can be extended greatly beyond the range of problems that 
have been considered so far in this book. The extension to systems of more than 
two variables which has only been briefly considered in this book is of funda¬ 
mental importance in chemistry where the number of molecules of a given type 
in the system must be treated as a variable. Similarly the thermodynamics of 
small systems where the ratio of surface to volume properties is not negligible, 
or very large systems where gravitational effects cannot be neglected, are of 
great importance in biology and astronomy respectively. 

Thermodynamics may also be extended (if certain further assumptions are 
introduced) to systems which are not in thermal equilibrium. The flow of heat 
across a finite temperature gradient for example has been seen to be an irrever¬ 
sible process. The theory of the transport properties (such as thermal conduc¬ 
tivity) of a system is therefore sometimes called the thermodynamics of irrever¬ 
sible processes and has been an active branch of theoretical physics for many 
years. Transport theory is considered from a simpler viewpoint in chapter 9 but 
the thermodynamic theory will not be considered in this book. 

The limitation of thermodynamics that always remains however, is that the 
fundamental equation from which all the equilibrium properties of the system 
can be calculated has to be obtained from outside thermodynamics. The funda¬ 
mental equation for a given system could be established by an enormous amount 
of experimental work. This approach may sometimes be necessary in an engineer¬ 
ing problem, but is unsatisfactory in science. What is required is an atomic 
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theory for the system of interest from which the fundamental equation can be 
deduced. All the equilibrium properties of the system can then be evaluated 
using thermodynamics alone. 

The microscopic (atomic) theory of the equilibrium properties of a system 
(called statistical mechanics) will be developed in the next section. Statistical 
mechanics also provides new insights into thermal processes. For example the 
entropy of a system, although well defined in terms of thermodynamics, may 
have seemed a rather mysterious quantity. What are the consequences at the 
atomic level of an increase in the entropy of a system? 

Thermodynamics can provide a clue to the answer to this question but the 
complete answer requires the atomic viewpoint of statistical mechanics. As an 
example of the thermodynamic argument consider the following information 
obtained from X-ray analysis of rubber fibres: when the rubber fibre is stretched 
it is found to be in the crystalline state. The atoms therefore form a regular 
ordered array. When the tension is removed the rubber forms an amorphous 
(disordered) material. This change may be correlated with the entropy of the 
rubber fibre as a function of tension by using the Maxwell relation 

a ■( 

obtained by substituting / for V and — ^for P in equation 4.13. The left-hand 
side of the equation is found by experiment to be negative for rubber, that is to 
say the length of a fibre held at constant tension decreases as the temperature is 
raised. This is of course the opposite change to that normally found, for example 
in a steel wire. The entropy of rubber when stretched at constant temperature 
therefore decreases and the degree of atomic order (amorphous to crystalline 
solid) increases. (The statistical mechanics of rubber is discussed in section 7.4.) 

If we are prepared to generalise this observation to all systems, then the 
second law of thermodynamics in the form: ‘the entropy of a system within 
adiabatic walls cannot decrease’ might be written: ‘the degree of atomic order of 
a system within adiabatic walls cannot increase’. The system cannot spontaneously 
pass to a state of higher order. (The rubber fibre was of course not within adia¬ 
batic walls but in contact with a heat reservoir at temperature T so its entropy 
could both increase and decrease.) 

This connection between entropy and atomic order will be developed in the 
next section, but it is worth stressing that the thermodynamic entropy is a 
completely macroscopic quantity (defined to within a constant by dS = &Qr/T) 
independent of any microscopic theory. An ‘atomic’ entropy must therefore be 
shown to have the same properties as the macroscopic entropy. The connection 
between entropy and atomic order is very difficult to establish for all but the 
simplest systems, but the macroscopic entropy remains a useful parameter. 


dS \ 
d^/T 
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EQUILIBRIUM 
STATISTICAL MECHANICS 



6. Weakly Coupled Systems 


The discussion in section 2.3 showed that thermodynamic concepts such as 
temperature are not required for the discussion of the behaviour of a single 
particle in a conservative system, since the energy of the particle is constant. 

When the isolated system consists of a number of particles (such as molecules 
of a gas) however, the individual particle energies change after each collision in 
such a way that the energy of the entire system (£) is conserved. 

E = e i + e 2 + •*• + e N 

In general it is no longer possible (or indeed interesting) to discuss the exact 
behaviour of each molecule. In thermal equilibrium the macroscopic properties 
of the system become independent of time and at the molecular level it is 
postulated that the fraction of the molecules with a given range of energies is 
also independent of time. An individual molecule will leave the chosen energy 
range after a collision but elsewhere in the gas another molecule will on average 
also make a collision such that it enters the energy range. 

The concept of a distribution function to describe the equilibrium properties 
of a classical perfect gas was first introduced by Maxwell and was later extended 
by Boltzmann. The distribution function ^ (u, r ) is defined such that: f 0 ( v , r) d 3 wd 3 
is the number of molecules of the gas in the velocity range v to v + du and with 
position between r and r + dr when the gas is in the thermal equilibrium. 

Velocity is a vector quantity involving both magnitude (speed) and direction. 

In many books however no distinction is made in the text between speed and 
velocity and it is then necessary to consider whether the vector or scalar quantity 
is under consideration. The symbols d 3 u, d 3 r are used to mean a three-dimen¬ 
sional volume element such as dv x dv y dv z in cartesian coordinates or v 2 du 
sin 6 d 6 d0 in polar coordinates as convenient. 

The general distribution function can often be simplified. When gravitational 
effects are neglected the energy of a molecule of a perfect gas is independent of 
position. The integral of ^ (i u , r) over all velocities and over the allowed volume 
of the gas (which is simply equal to N , the total number of molecules) is then 
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J J fo (v, r ) d 3 »d 3 r = J / 0 (») d 3 » J d 3 r = N 

Therefore 

r ^ (») d 3 »= ~ 

J V 


The correct form for the velocity distribution function / 0 (u) was first given by 
Maxwell by an ingenious but inadequate argument (exercise 6.1) and first derived 
rigorously by Boltzmann for the case of binary collisions of gas molecules 


fo ( v ) d 3 t> = 


N 

V 



d 3 u 


( 6 . 1 ) 


This equation will be derived as a special case of the general results of statistical 
mechanics in section 8.1. In equation 6.1, m is the mass of the molecule, k is the 
Boltzmann constant and T the absolute temperature. 

An alternative interpretation of equation 6.1 is that the probability of a given 
molecule having velocity in the range u to u + do is given by 


rnv 2 

2 e 2kT d 3 w (6.2) 

The transition between the properties of a single molecule (which can only be 
described statistically by equation 6.2) and the whole system of N particles is 
only possible because in a perfect gas the molecules are on average well separated 
and weakly interacting . This separation would not be possible in a liquid, where 
the properties of the system as a whole must always be considered. 

The fundamental concept of thermodynamics was found to be the entropy 
function. The fundamental concept of classical statistical mechanics is the dis¬ 
tribution function. The relationship between these two quantities was first 
established by Boltzmann who defined a function H such that 

" - jv (u,r, t) In f(y,r, t) d 3 v d 3 r (6.3) 

Boltzmann showed that as a result of binary collisions in the gas an arbitrary 
function f(v , r, t) always changed in time until it came into the Maxwell form 
given by equation 6.1. The function H always decreased with time to a minimum 
value which was associated with thermal equilibrium. A connection is therefore 
suggested with the (negative) of the entropy of the system since the entropy 
cannot decrease for a system within adiabatic walls. The form 


S B = - kVH 0 


(6.4) 
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where H 0 is the function of equation 6.4 with f 0 (u, r) given by equation 6.1 may 
be shown directly (exercise 6.2) to give the entropy of a perfect gas in agreement 
with thermodynamics. 

A number of objections were advanced against the H theorem and it was 
subsequently modified by Boltzmann and others (see Tolman, 1938 for 
example). Consider, for example, that d 3 v represents the limit of some small 
volume A 3 u. The number of particles in A 3 u may be calculated as a function 
of time by computer experiments (section 1.2) and is found to fluctuate even 
for times greater than the relaxation time of the system. The value of H there¬ 
fore also fluctuates, as shown in figure 1.4. The question of fluctuations is 
considered later (section 7.1) but it is sufficient for the moment to state that 
since the importance of fluctuations in the classical region decreases as 
the inverse square root of the number of molecules, they will normally be 
unimportant for macroscopic systems containing perhaps 10 20 particles. 

The classical Boltzmann entropy function was extended to include quantised 
systems by Planck. The approach via quantum mechanics is in fact simpler than 
the earlier classical argument and will be used in this book. A perfect gas is one 
example of a weakly interacting system of particles. These systems are of great 
importance because of their relative simplicity and will be defined and discussed 
in the next two sections. The Boltzmann - Planck entropy will then be calculated 
for a number of systems and will be shown to lead to the fundamental equation 
(in the sense of chapter 4) from which all the thermodynamic properties of the 
system may be calculated. 

6.1 SYSTEMS OF IDENTICAL PARTICLES 

The present section is intended to form a very brief revision of some of the 
concepts of quantum mechanics, particularly as applied to systems of identical 
particles. The reader who is unfamiliar with these concepts may still follow the 
argument of section 6.3 and 6.4 but would need to consult a text on quantum 
mechanics before reading section 6.5. 

According to quantum mechanics, the Schrodinger wave equation of a system 
contains all the information available about that system. This information may 
however be less than would be expected from the classical physics of macro¬ 
scopic bodies. For example the two electrons in the helium atom are considered 
to be indistinguishable by experiment, whereas the moons of a planet are 
certainly distinguishable. 

The time-independent Schrodinger equation 

Jfi// = Et 

where K is the Hamiltonian operator and E the energy, has in general both 
discrete (quantised) energy solutions and also regions in which the energy is 
continuous. The electronic energy levels of the hydrogen atom are shown in 
figure 6.1. The energy levels are quantised but come closer and closer together 
at higher energies. 
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Figure 6.1 The lower allowed electronic energy levels of the hydrogen atom. The degeneracy 
of each level is shown at the right. The degeneracy is equal to In 2 where n is the principle 
quantum number. 


A solution of the wave equation is called a state of the system, or a microstate 
if it is thought necessary to distinguish it from the thermodynamic state (macro¬ 
state) discussed in part I. In general a number (g ) of solutions of the wave equa¬ 
tion have the same energy (E n ). The energy level n is then said to be g--fold 
degenerate. In many cases it will be seen that it is not necessary to consider the 
effect of very small splittings of the energy levels. If the separation between a 
number of levels is much less than the thermal energy (kT) they may be grouped 
and treated as one degenerate level. 

The wave function of a free particle in space has the form 

^ = Ae lk ' r 

(where A is a normalising constant, k = 27r/X and X is the de Broglie wavelength 
of the particle) and the wave equation has only a continuous energy solution. 
However if the particle is considered to be confined to a box of volume V the 
energy levels become quantised, although for a macroscopic box (say of side 
0.01 m) the levels are extremely close together (appendix IV and exercise 6.3) 
relative to the thermal energy kT. The allowed energy states may then be 
treated as a continuous function (the density of states) such that 

D(e) de = number of allowed states in the 
energy range e to e + de 

The density of states function for the molecules of a gas in fact provides the 
link between the quantum and classical theories of perfect gases. 
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When the system of interest consists of more than one identical particle, the 
wave functions of the individual particles will in general overlap and the particles 
will interact so strongly that the allowed energy levels of the whole system will 
become quite different from the energy levels in the individual particles. The 
energy levels of the electrons in a crystalline solid are shown schematically in 
figure 6.2. When the atoms are widely separated (as in a gas) each atom has the 
same set of electronic energy states. As the distance between the atoms decreases 
the sharp atomic energy levels spread out into bands of allowed energy separated 
by regions (gaps) in which no energy states exist. The width of a band depends 
upon the degree of overlap of the atomic wave functions. In general then, there 
is no simple relationship between the wave function of the whole (strongly 
interacting) system and that of the separated atoms. 



Figure 6.2 The energy levels of the isolated atom (right) are broadened into bands when the 
wave functions overlap as in a crystalline solid. A typical lattice spacing (a) is shown. 


It is however possible to say something about the behaviour of the wave 
function of any system if two identical particles with overlapping wave functions 
are interchanged. Consider for simplicity a system of just two identical particles 
in a non-degenerate state. Then if the coordinates of the two particles are 
written A and B respectively, where the coordinates are the three spacial dimen¬ 
sions plus the spin of the particle 

*(A,B) = ± (KB,A) (6.5) 

When the sign is positive the wave function is called symmetrical, when negative 
antisymmetrical. 
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The restriction imposed on the wave function by equation 6.1 arises because 
it is not possible to distinguish between the two particles. The probability that 
one particle is in a volume dV A at A and the other in a volume dV B at B is given 
by B)P dV A dF B . Since it is not possible to say which of the particles is 
at A and which at B the interchange of the two particles does not lead to a new 
state of the system and li//(A, B)l 2 is equal to l\[/(B, A) I 2 . 

It may be shown that a system in a state described by a wave function of one 
symmetry can never pass to a state described by a wave function of the opposite 
symmetry. 

The fundamental particles are therefore divided into two groups which may 
be shown to be characterised by the spin of the particle. Particles with integral 
or zero spin (for example photons, 4 He) have symmetrical wave functions. They 
obey Bose - Einstein statistics (section 6.5) and are often called bosons. 

Particles with half-integral spin (for example electrons, protons, neutrons, 

3 He) have antisymmetrical wave functions. They obey Fermi-Dirac statistics 
(section 6.5) and are often called fermions. 

The Pauli exclusion principle arises directly from the requirement that the 
wave function of a system of particles with half-integral spin be antisymmetric. 
The wave function is found to vanish if two particles are in the same single-particle 
quantum state (including spin). A statement of the principle is: only one fermion 
can be placed in each single-particle state of the system. 

If the spin states are considered separately then (2s + 1) fermions can be 
found in each energy state. Therefore a maximum of two electrons (s =\) 
can be found in each energy state when the state is defined without spin. 

In general the Pauli exclusion principle requires that the properties of a gas 
of fermions be quite different from those of the classical Maxwell gas (as might 
be expected) but it is also found that a boson gas may be quite different from 
both the classical and fermion gases. It might therefore appear that two or three 
perfect-gas laws are required, but we shall see that at the densities and tempera¬ 
tures at which gases exist the quantum corrections to the classical equation of 
state ( PV = RT ) are usually negligible (section 7.5). 


6.2 WEAKLY COUPLED SYSTEMS 

When N identical particles are brought together to form a macroscopic system 
they will in general interact strongly (as discussed in the last section). A parti¬ 
cularly simple situation arises however where the interaction of the particles is 
so weak that the allowed energy levels of each particle are unchanged. This 
situation could arise in two rather different ways. 

A localised system of particles could be established with a sufficiently large 
separation between the particles for the overlap of their wave functions to be 
neglected. The interaction must be just sufficient for the system as a whole to 
come to a common temperature. In this case each particle interacts indepen¬ 
dently with an external stimulus (say a magnetic field) and the Af-body problem 
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reduces to a single-body problem. The localised, weakly interacting particles 
must be treated as distinguishable since the sites of the particles may be labelled. 

The separation of the atoms in a solid is not sufficient for the whole system 
to be treated as a collection of independent particles but it will be seen that 
certain aspects of the solid state can be treated (either exactly or to a first 
approximation) within the weakly coupled model. A nucleus with spin-|, for 
example, has a very small magnetic moment whose interaction with a neighbour¬ 
ing nucleus may be neglected relative to the interaction with a magnetic field. 

The nuclear magnetism may therefore be treated as the interaction of N weakly 
coupled nuclei with the magnetic field. As a further example, the lattice 
dynamics of a solid can be treated, to a rough approximation, as equivalent to a 
set of N independent simple harmonic oscillators (the Einstein model, section 
10.3). 

When the wave functions of a set of non-localised particles are allowed to 
overlap the particles must be treated as indistinguishable. When the energy of 
interaction between the particles is small the wave function of the whole system 
may be considered to be formed from the products of the wave functions of the 
individual particles, and can only be symmetrical or antisymmetrical depending 
upon the spin of the particles, as discussed in section 6.1. The particles are then 
distributed over the allowed single-particle energy states, allowing for the 
exclusion principle if the particles are fermions. The single-particle energy states 
must be found using quantum mechanics. 

The present section may be summarised as follows. The wave function of a 
macroscopic system is a function of all the particles in the system. This is an 
A^-body problem to which there is in general no exact solution but in a weakly 
coupled system the only information required is 

(a) The allowed energy states of a single particle. 

(b) Whether or not the particles are localised. Localised particles will be 
treated as distinguishable, non-localised (identical) particles as indistinguishable. 

(c) If the particles are non-localised they must be classified as either bosons 
or fermions. The restriction that no two fermions can be in the same single-particle 
state must be observed. 

(d) The macroscopic state of the system (for example constant energy and 
volume) must be specified. 

The methods of statistical mechanics will now be applied to a particularly 
simple localised system (in section 6.3), and to a gas (in section 6.5). The general 
results of statistical mechanics will be discussed in chapter 7. 


6.3 TWO MODEL SYSTEMS 


The simplest quantised system to discuss is an assembly of N identical weakly 
interacting localised particles, each of which can only exist in one of two 
non-degenerate energy levels separated by an energy e [figure 6.3(a) and (b)]. 
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(b) 
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Figure 6.3 (a) A two - level system. A particle can exist only in the state of zero energy or 
with energy e. (b) The energy of the system (the macrostate E = 2e) is specified by the 
occupation numbers, (c) The microstate of the system requires the specification of the 
energy level occupied by each particle. To each macrostate there corresponds a finite 
number of microstates. 


The application of this model to a real system is discussed in chapters 7 and 10. 

Since the particles are localised they are to be treated as distinguishable and 
the interchange of two particles between the two energy levels leads to a new 
microstate of the system, although the energy of the system is unchanged. The 
energy of the whole (isolated) system ( E ) and the number of particles ( TV ) in 
volume ( V ) defines the macrostate of the system. The distinguishable microstate 
for a given (TV, E , V ) can then be enumerated. 

The microstates for TV = 3 and E = 2 are shown in figure 6.3(c). Notice that 
the macrostate is defined by just the number of particles in each energy level 
(the occupation numbers) but the microstate requires the specification of the 
particular particles in each level because the localised particles must be treated 
as distinguishable. The number of microstates for a given macrostate therefore 
depends strongly upon the distinguishability of the particles. 

In order to go beyond the representation of the microstates shown in figure 
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6.3 it is essential to make some assumption about the probability with which 
each microstate occurs. The fundamental assumption of statistical mechanics 
(the assumption of a priori probabilities) is: ‘a closed system (that is with 
N, E, V constant) in thermal equilibrium is equally likely to be in any of the 
microstates accessible to it’. A microstate is said to be accessible if it satisfies the 
macroscopic conditions imposed on the system (such as N = 3, E = 2 in figure 
6.3). The microstate refers to the system as a whole. In the case of weakly 
coupled systems the accessible states of the whole system may be constructed 
using both the macroscopic conditions and any restriction on the filling of the 
single-particle states (section 6.4). 

The justification of the assumption of a priori probabilities must ultimately 
rest upon the agreement between the calculations of statistical mechanics and 
experiment. It is in fact the simplest assumption that can be made and is some¬ 
times justified in terms of our ignorance of the detailed behaviour of atomic 
systems. 

If each of the microstates of the model system is treated as having equal 
probability, an equation for the number of distinguishable microstates for a given 
macrostate may be found immediately. The energy of the system can only take 
on the value Me where Mis the number of particles in the upper energy level. 
There are then (N - M) particles in the lower level. The total number of per¬ 
mutations of N particles is N\ (TV! = 1x2x3x .. .x (N — l)x N) but the 
order in which the identifiable particles go into the energy levels does not 
change the microstate. The total number of microstates [W{E)] for energy 
Me is therefore 


_ N\ 

W(E) =- 

M\(N-M)\ 


( 6 . 6 ) 


(E = Me) 

There is nothing more to say about the microstates of the system so a connec¬ 
tion between W(E) and the entropy of the system must be sought. The first 
statement of this connection for a quantised system was given by Planck, 
following the classical statistical mechanics of Boltzmann (equation 6.4) 

S B = k\nW(E) (6.7) 

where k is the Boltzmann constant. The form S B is used to stress that it must 
still be shown that this function is equivalent to the thermodynamic entropy 
(S'). The Boltzmann - Planck entropy is correctly extensive since if two identical 
systems Si and S 2 are joined together 

S = Si + S 2 (6.8) 

but the number of states accessible to the new system is 

W = Wi W 2 


(6.9) 
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The form of equation 6.7 satisfies equations 6.8 and 6.9. As was remarked at the 
beginning of this section however, the Boltzmann - Planck entropy allows the 
entropy of an isolated system to fluctuate and therefore decrease from its 
equilibrium value (although the fluctuations are normally negligible). This point 
will be discussed further in chapter 7. 

The fundamental equation of the model system has therefore been obtained 
(if it is agreed to accept the entropy defined by equation 6.7 as equivalent to the 
thermodynamic entropy). 

S B = k(lnN\ - In Ml - In (N-M)\) (6.10) 

An important simplification of this equation is obtained if it is remembered that 
N is normally very large (« 10 24 particles) and that for large numbers 

In iV! « NlnN - N (Stirling’s approximation) (6.11) 

If M and (M - N) are also taken as being very large, the equation becomes 

S = k[NlnN - N - Min AT + M - (N-M)\n(N-M) + N-M] 

= k[NlnN - MlnM - (N- M) In (N-M)] ( 6 . 12 ) 

The temperature of a macroscopic system is defined (from the second law of 
thermodynamics) 


TdS = dE + PdV 

1 = = I 

T \dE)v 6 \dM/v 


(6.13) 


where the total energy (E) has been taken as identical to the thermodynamic 
internal energy. Applying equations 6.12 and 6.13 


or 


Therefore 


— [ln(/V-M) - InM] 

T e 

€ 

M = ( 


M = 
E = 


N 

1 + e f / fcT 
Ne 


(6.14) 


(6.15) 


1 + e 6 /* 7. 


(6.16) 
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The entropy of the system at temperature T may be found by substituting for 
M and (N - M) in equation 6.12. 

Notice that as the temperature goes to absolute zero the energy and the 
entropy both go to zero. The model system is therefore in agreement with the 
third law of thermodynamics, which in terms of equation 6.7 may be reinter¬ 
preted: ‘there is only one allowed microstate of a system at absolute zero.’ It is 
widely believed (although there is in fact no proof) that the true ground state of 
any system is non-degenerate. In fact so long as the ground state is only £-fold 
degenerate (rather thang^-fold) the contribution to the entropy would be 
vanishingly small due to the fact that the entropy depends upon In W(E) (see 
equation 6.20). 

There are three other features of the model calculation which should be 
noted because of their general importance. The original system was defined to 
be an isolated system of constant energy. An expression was then found for the 
number of microstates which satisfied the specified energy ( E ) and the entropy 
of the system was defined by equation 6.10. The temperature however could 
only be introduced once the assumption had been made that the number of 
particles in the system was large. ‘A large isolated system acts as its own thermal 
reservoir and a temperature may be defined for the system.’ 

The second point to notice is that the calculation using Stirling’s approxi¬ 
mation, while justifiable for large N, also required M and (N - M) to be large. 

At low temperatures AT is not large and it might therefore be thought that 
equation 6.15 was only approximately correct. This is not in fact the case, as 
will be seen later, if M is taken to be the thermal average number of particles in 
the upper state at temperature T. That is to say the temperature is treated as the 
fundamental quantity, rather than E. The energy defined by equation 6.16 is 
then the average energy at temperature T and fluctuations about the mean 
energy are now allowed (section 7.1). 

The third point is that expressions like ‘high temperature’ and ‘low tempera¬ 
ture’ have a definite meaning for quantised systems. The important parameter is 
not the temperature as such but the ratio e/kT , as may be seen from equation 
6.15 or 6.16. A characteristic temperature T g may be defined in terms of the 
separation of the allowed energy levels. 

e = kT g (6.17) 

At high temperature T>T g and from equation 6.15 M = N/2. It is left to the 
reader to show that in this high temperature limit 


S = Aft In 2 (6.18) 

The specific heat associated with a two-level system therefore goes to zero at 
both high temperature and low temperature (where S goes to zero) since 


C v = T 


dS\ 

dTjv 
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Figure 6.4 (a) The allowed energy levels of a simple harmonic oscillator extend without 
limit to higher energies; The zero - point energy is shown as a broken line, (b) The 
allowed microstates for N = 3, E = 4e. 


The two-level system is rather unusual in that most quantised systems do not 
have an energy level beyond which the energy cannot rise. One of the conse¬ 
quences of this maximum in the energy is discussed in section 10.5. 

A simple harmonic oscillator has a set of energy levels given by e = (n +\)e 0 
where n is an integer, greater than or equal to zero and e 0 = hv where v is the 
frequency of the oscillator and h the Planck constant. The levels therefore 
extend to higher energies without limit. The allowed microstate for a set of N 
weakly interacting simple harmonic oscillators with total energy E may be found 
by exactly the same method as for the two-level system although the results are 
rather more complicated. It is convenient to take the energy zero at e 0 l 2. The 
allowed energy levels are then simply a ladder of spacing e 0 [figure 6.4(a)]. 
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(c) 

Figure 6.4 (c) The occupation numbers for N = 5, E = 4e and the number of times they occur. 


The allowed microstates for three particles and a total energy of 4e 0 are 
shown in figure 6.4(b). The occupation numbers of the energy levels are seen 
to occur in three or six ways. When the number of particles is increased to five 
with total energy 4e 0 a very important feature (characteristic of large systems) 
begins to appear [figure 6.4(c)]. One particular set of occupation numbers 
(2,2,1,0,0) occurs more frequently than any other set and the distribution of the 
occupation numbers over the energy levels in this most probable set is also close 
to that in the next most probable (3,1,0,1,0). For the simple harmonic oscillator 
it is still possible to write an exact expression for the number of microstates 
(exercise 6.4) and to calculate the occupation of the levels at temperature T as 
was done for the two-level system. In general however this exact calculation is 
not possible, as will be seen in the next section and for the case of a gas in 
section 6.5. 

The solution to this problem is to note that as the number of particles 
increases the most probable distribution of occupation numbers (already 
observed for five particles) becomes completely dominant. Furthermore since 
even for five particles the probability of occupation of an energy level within the 
most probable arrangement is close to the probability calculated over all the 
microstates (exercise 6.5) it will be sufficient for a large system to calculate only 
the distribution within the most probable arrangement. The entropy is then 
defined by the approximate expression 


S = k In Wmp (6.19) 

where Wmp means the number of microstates in the most probable arrangement. 

The distinction between equation 6.19 and the original expression for the 
entropy (equation 6.7) is unimportant for large systems because S depends on 
In W. As an example consider the high-temperature limit of the two-level system; 
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then S = k In W{E) =Nk In 2. The number of microstates available to the system 
is (with TV typically 10 24 for a macroscopic system) 

c Nbl2 10 3 « 10 1024 

This is a number compared to which even numbers like 10 10 are completely 
insignificant. Once the logarithm has been taken, the error in the entropy caused 
by neglecting 10 10 states is 

3S « *ln 10 10 « 23 k (6.20) 


but the entropy is 


S « 10 2 *k (6.21) 

In the discussion of the general localised weakly coupled system in the next 
section we will therefore set out to find the distribution of particles over the 
energy levels which occurs in the maximum number of ways. This most probable 
distribution will be considered to be effectively identical to the actual distribu¬ 
tion for a large system. 


6.4 THE GENERAL WEAKLY COUPLED LOCALISED SYSTEM 

The general weakly coupled localised system may be defined as follows: the 
system consists of N distinguishable particles each of which may exist in a set 
of energy levels of energy e r with degeneracy g r . The total energy of the isolated 
system is E and both E and N remain constant. The number of particles in each 
energy level at any instant must therefore satisfy the two sum-rules 

E = n x €i + n 2 e 2 + ... = %n r e r (6.22) 

A^=« 1 +«2 + *-* = 2 n r (6.23) 

The number of microstates accessible to the system if the energy levels were 
non-degenerate would be given by a generalisation of the two-level result 
(equation 6.6) 


W(E) 


N\ 

n t ln 2 l . . .n r \ ... 


(6.24) 


When an energy level is &-fold degenerate however and contains n r distinguish¬ 
able particles, the particles can be distributed among the microstates in g r nr ways. 
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The total number of microstates is therefore 

W(E) = n r — g r n ' (6.25) 

n r \ 

where it must be remembered that the n r are restricted by the two sum-rules 
given in equations 6.22 and 6.23. 

The most probable values for n r may now be found by maximising equation 
6.25 subject to the restrictions of 6.22 and 6.23. Taking the logarithm of 
equation 6.25 and using Stirling’s approximation (equation 6.11) 

In W(E) = IniV! + 2« r ln,g r — Sin n r \ 

= N]nN - N + 'Ln r ]ng r - SOvln n r - n r ) 

r r 

= N\aN + %n r (]ng r — In n r ) 

The maximum value of In W{E) will occur for the distribution of the n r dis¬ 
tinguishable particles for which 5 In W(E) is zero. 

- 6 In W{E) = - S(ln g r - 1 - \nn r )Zn r = 0 (6.26) 

The function In W(E) is here being approximated by a continuous function. 

The variation in In W(E) must take place subject to the two conditions 


bN = SSrt, = 0 

r 

(6.27) 

bE = Se r 5« r = 0 

r 

(6.28) 

Therefore equation 6.26 becomes 


2 (In n r — \ng r )bn r = 0 

(6.29) 


This problem of a maximum subject to certain restraints may be handled by the 
method of Lagrange undetermined multipliers (appendix III). The equation 6.27 
is multiplied by some factor (a), the equation 6.28 by some factor (/3) and the 
two equations added to equation 6.29. Then 

S (In n r — \ng r + a + $e r )bn r = 0 (6.30) 

r 

The effect of the undetermined multipliers is to make the 5 n r independent. In 
this case the only solution to the equation is for the bracket in equation 6.30 to 
be zero for all r. Therefore 


n r = g r Q 


—(a + (3e r ) 


(6.31) 
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where a and fi, the undetermined multipliers, have still to be found. Equation 
6.31 provides the solution to the most probable number of particles in the 
energy level r of degeneracy g r . The most probable number of particles in one 
single-particle state is therefore simply 

n r = e - ( “ + ' 3e '- ) (6.32) 

It will be shown in section 7.5 that (3 is given by 


P = 


1 

kT 


(6.33) 


where k is, as usual, the Boltzmann constant and T the absolute temperature. 

The constant a is obviously determined at a given temperature by the condition 
that the total number of particles be N. Therefore 

= N = e~ a 2 e~* e ' (6.34) 

r r 

where the sum runs over the states of the system or equivalently 

N = e~“ 2g> e~P e r (6.35) 

where the sum is over the energy levels of the system. The complete most 
probable distribution of particles over the allowed states of the system can 
therefore be found if the allowed states are known. The results for the two-level 
system may be verified directly (exercise 6.7). 

The method used in this section to derive the most probable distribution of 
particles over the allowed energy states is extended to perfect gases in the next 
section, but it is perhaps already obvious that its application is limited to weakly 
coupled systems. In chapter 7 the generalisation of statistical mechanics to all 
systems in thermal equilibrium will be discussed and the approximate result for 
the most probable number of particles in a state (equation 6.32) will be seen to 
be the exact result for the thermal average number of particles in the state. 


6.5 A GAS OF WEAKLY COUPLED PARTICLES 

The distribution of the velocities of a perfect gas given in equation 6.1 was 
derived by Maxwell using the methods of classical physics. The form of the dis¬ 
tribution function is correct but is really the limiting case of the quantum- 
mechanical distributions which will now be considered. In section 6.1 it was 
observed that there are two types of particles. Those with integral or zero spin 
have symmetrical wave functions and will be called bosons. In a gas the particles 
are to be treated as indistinguishable but there is no restriction on the number of 
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bosons in a single-particle state. The particles with half-integer spin (fermions) 
are also indistinguishable but their antisymmetric wave functions lead to a 
maximum of one particle per single-particle state (the Pauli exclusion principle). 
The counting of distinguishable microstates for a given macrostate is therefore 
different from the results for localised particles discussed in section 6.3 and 6.4. 

The allowed energy states of a particle in a box are derived in appendix IV. 

The states are extremely close together for a box of macroscopic dimensions and 
may be treated in terms of a continuous function (the density of states) defined 
by equation A4.31. The counting of the microstates for a given macrostate is 
therefore performed by considering the number of energy states in some small 
range of energy Ae r to be 

gr = D(e) Ae r (6.36) 

The range Ae r must be large enough to make g r large [since it will be necessary 
to use Stirling’s approximation (equation 6.11)] but small enough not to affect 
the calculation of macroscopic quantities. However since the example given by 
equations 6.32 and 6.33 showed that macroscopic quantities like the entropy 
are rather insensitive to small changes at the microscopic level it would be 
expected that the restriction on Ae r is not serious. 

The macroscopic state of the gas is defined by the number of particles (AO, 
the volume ( V ) and the allowed energy of the isolated gas ( E ). In fact it is only 
necessary to say that the energy is defined within some value SE, rather than the 
non-physical value of exactly E , since this small uncertainty will again (as in the 
case of the entropy) not affect the results for a large system. 

The macroscopic conditions are therefore summarised by the equations 

N = En r SN = = 0 (6.37) 

E = 2n r 5 E = Se r 5 n r = 0 (6.38) 

in exactly the same way as for the localised system. 

The number of distinguishable microstates for n r indistinguishable particles 
distributed over g r states is however quite different from equation 6.25 where 
the localised particles were treated as distinguishable, and also depends upon 
whether the particles are bosons or fermions. 

In a gas of bosons there is no restriction on the number of particles in a single 
state. The n r particles could be divided into g r groups by placing (g r — 1) parti¬ 
tions between the particles (figure 6.5). The total number of objects under 
consideration is now (n r +g r — 1). The total number of permutations is 
(n r +& ~ O’ but the permutations of the n r indistinguishable particles ( n r !) 
and the (g r — 1) indistinguishable partitions (g r — 1)! do not lead to a new 
microstate. The total number of distinguishable microstates is therefore 

(rt, + g r ~ 1)! 


n A(gr ~ 1)! 


(6.39) 
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Figure 6.5 Bose - Einstein counting. The particles can be divided into four groups by 
inserting three partitions. Therefore in general into g groups by inserting (g - 1) 
partitions. 


for the group of states r and 




= n r 


(n r + g r - 1)! 

n r '.(g r - 1)! 


(6.40) 


for the whole system. The argument leading to equation 6.39 is sometimes 
rather difficult to follow. A direct calculation of the states for a small number 
of particles (exercise 6.7) may give increased confidence in the correctness of 
the result! 

In a gas of fermions a maximum of one particle is allowed to be in each 
single-particle state. The first particle can therefore be distributed over the g r 
states in g r ways but the second in only {g r — 1) ways and so on. If the particles 
were distinguishable the total number of ways of distributing n r particles over 
g r states would be 


gri£r - 1) (gr - 2) . . . ( g r ~ «r + 1) = ;- 

(gr ~ n r )\ 

The number of states must of course be greater than or equal to the number of 
particles. Since the particles are indistinguishable the number of microstates for 
the system is 


w F = n, --(6.41) 
nA(gr - n r )\ 

The calculation of the most probable distribution can now be carried out using 
Stirling’s approximation provided that n r ,g r and (g r — n r ) may all be taken as 
much greater than unity 

In W B = 2 [(Wf + gr) l n ( n r gr) - In n r - g r In g r \ 

In Wp = 2 [(n r g r ) In ( g r - n r ) - «, In n, + g r In g r ] 
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- 6 In W B = 2 [In n r - In ( n r + g r )] 8n r = 0 

-5 In W F = 2 [In n r - In ( g r - n r )] 8n r = 0 

where the maximum must be found subject to the conditions given by equations 
6.37 and 6.38. As in section 6.4, two Lagrange undetermined multipliers are 
introduced to make the variations 8n r independent 


- 8 In W B = 2 [In n r - In (n r +g r ) + a 

r 

— 8 In W F = 2 [In n r - In ( g r -n r ) + a 

r 

and each term in the brackets must be zero. 

Hence for bosons 


n r 


_ gr _ 

Q OL + p€r _ i 


+ j3e r ] 8n r = 0 
+ Pe r ] 8n r = 0 


(6.42) 


and for fermions 


n r 


_ gr _ 

e a+ P*r + 1 


(6.43) 


The equation 6.42 was first derived independently by Bose and Einstein and 
equation 6.43 by Fermi and Dirac. They are usually called the Bose - Einstein 
and Fermi - Dirac distribution functions and will be considered in more detail 
in section 10.1. The Lagrange undetermined multiplier |3 is again equal to (kT)~ l 
as in section 6.4 and the multiplier a, at a given temperature, is determined by 
the normalisation condition 


N = 2 

r 


gr _ 

Q QL+Pe r + j 


(6.44) 


where the positive sign applies to fermions and the negative to bosons. This sum 
may be converted to an integral [because for a particle in a box the states of the 
system have been found to be very close together (see appendix IV)] using the 
density of single-particle states function 


N = 



D(e)de 
e <*+0e + 2 


(6.45) 


The density of states function derived in appendix IV then leads to 


N = 2ttK 



± 

e 2 de 


e <*+0e ± 2 


(6.46) 
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(ignoring the spin factor). This equation defines a at a given temperature, 
remembering that 0 = (kT )~ l , but cannot be solved in general in closed form. 

The general equations 6.44 and 6.46 will be discussed in section 10.1, but it 
is useful to obtain the classical limit of equation 6.46 at this point. The quantum- 
statistical result given in equation 6.46 may be expected to reduce to the classical 
Maxwell distribution when the mass of the particles is large, the number density 
of the particles is small (therefore when the volume of the gas is large for a given 
number of particles) and the temperature of the system is high. Under these 
conditions it may be seen from equation 6.46 that exp a must become much 
greater than one and the integral may be written 


N = 2nV 


= 2itV 



r 

J 0 


1 

62 de 



e de 


(6.47) 


The two quantum distributions have therefore become identical in the classical 
limit, as was to be expected since all perfect gases have the same equation of 
state regardless of the spin of the particles which make up the gas. The value of 
a may be found by putting the integral into standard form using the substitution 


x = j3e dx = P de 

l 

f (»oo j_ 

J o e-* dx 

The integral is a gamma function with value y/n/2 (appendix II). Therefore 


.. / ImkT \ 2 


N 


U-) 

V V 2-nmkT J 


(6.48) 


The classical limit requires that exp — a 1. The physical significance of this 
may be seen by writing equation 6.48 


F\T 
N 


h 2 \ 2 


2nmkT, 


(6.49) 


The left-hand side of the equation is the mean separation of the particles. The 
right -hand side may be shown to be of the order of the de Broglie wavelength 
of a typical particle from the equations 
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e = \mv 2 = —— 
2m 


h 2 

2 mX 2 


The mean energy of a gas molecule in a monatomic classical gas is 3kT/2 
(section 7.5) so 


\J(2me) 


h 

\ /(3mkT) 


(6.50) 


which is comparable with the right-hand side of equation 6.49. Quantum effects 
are therefore unimportant when the separation of the molecules is much greater 
than their de Broglie wavelength. 

The inequality given by equation 6.49 is satisfied by real gases at normal 
temperature and pressure (exercise 6.8). The deviations from the perfect gas law 
PV = RT discussed in section 2.2 are due to interactions between the molecules 
rather than to quantum effects associated with exp — a becoming small. (See 
also section 10.1.) 

The conclusion drawn from equation 6.49 that quantum effects are unimpor¬ 
tant when the mean separation of the particles is much greater than their 
de Broglie wavelength is physically reasonable on the basis of wave mechanics 
since under these conditions the ‘particle’ rather than the ‘wave’ aspect of the 
molecule will be dominant. 

An alternative way to view the transition from the two quantum gases to the 
single classical gas is to note that the exclusion principle will not have any 
practical effect on the selection of states for fermions if there are far more states 
available to the system than particles to fill them. The number of states per unit 
energy range increases with the volume of the box and the mean energy of the 
system increases with temperature. A combination of a low particle number- 
density and high temperature would therefore be expected to be favourable to 
the classical limit, in agreement with equation 6.49. When there are far more 
states available than particles (g r > n r ) equations 6.40 and 6.41 both reduce to 

W = ll r — g r n r (6.51) 

n r \ 

The most probable number of particles in the state r is now 

n r = e -(<*+Pe r ) (6.52) 


in agreement with the earlier calculation for localised particles but the number of 
distinguishable microstates is a factor of N\ less than the number for localised 
particles (equation 6.25) because the gas particles must be treated as indistinguish¬ 
able. In the earlier classical theory due to Maxwell and Boltzmann all particles 
were treated as distinguishable so equation 6.51 is sometimes called the corrected 
Maxwell - Boltzmann counting of microstates for non-localised particles. 
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The equilibrium properties of the quantum and classical gases could now be 
developed and in particular the classical perfect gas equation P = nkT could be 
derived so that the identity ]3 = (kT)~ l was established. This will be left until 
section 7.5 however after the general results of statistical mechanics have been 
obtained in chapter 7. 


6.6 CONCLUSION 

The approach to statistical mechanics discussed in this chapter may have been 
felt to be rather unsatisfactory. The use of Stirling’s approximation for example 
when not all the terms in the equation are very large is mathematically unjustifi¬ 
able and must involve the physical assumption that the neglect of small terms 
has no effect on the calculation of the macroscopic properties of the system. 

This was shown to be plausible for the entropy in the high-temperature limit 
where S is of order Nk but is more difficult to justify at low temperature where 
the entropy goes to some small value or to zero. 

The second curious feature of the calculations in this chapter is that the 
original macrostate was specified in terms of constant energy (to within some 
small range of energy) but once the system had been taken to be macroscopic it 
was possible to use the Boltzmann - Planck equation S = k In W(E) and the 
thermodynamic equation 

T \3 U )v \dEjv 

to define a temperature for the model system. Similarly in the more general 
cases of weakly coupled particles discussed in sections 6.4 and 6.5 a Lagrange 
undetermined multiplier (fi ) had to be introduced which can be shown to be 
equal to 1/kTby first deriving the perfect gas lawP = nkT and then recognising 
that in mutual thermal equilibrium all systems must have the same value of T 
(section 2.1). This transition required the identity of the Boltzmann entropy 
with the thermodynamic entropy, and of the total energy of the system with the 
thermodynamic internal energy. The approximation involved here (from using 
Stirling’s equation to find the entropy) becomes smaller as the size of the system 
increases. 

The limit in which 

V^co, (-'J (6.53) 

is called the thermodynamic limit. The thermodynamic theory presented in part 
I also involved the above limit since, for example, the internal energy was treated 
as an extensive quantity ( U=uV ) which is only justifiable if the atoms near the 
surface of the sample form a negligible fraction of the whole. Thermodynamics 
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and statistical mechanics should therefore lead to exactly the same results in the 
limit given by equation 6.53. 

The introduction of the temperature of a system as a derived property of a 
large isolated system is rather in conflict with the view of temperature as a 
fundamental thermodynamic quantity. A real experimental system is never 
completely isolated from its environment, more usually it is in contact with a 
heat reservoir at temperature T. In chapter 4 the transition from a thermo¬ 
dynamic isolated system to one at constant volume and temperature was found 
to lead to the Helmholtz free energy (rather than the entropy) as the funda¬ 
mental thermodynamic quantity. Similarly in the next chapter the relationship 
S = k In W(E) for an isolated system will be shown to be replaced by an equa¬ 
tion which is most simply expressed in terms of the Helmholtz free energy. 

The most unsatisfactory aspect of the method presented in this chapter 
however is that it is quite impossible to perform calculations for systems which 
are not weakly coupled. A strongly interacting system must be treated as a whole. 
The total energy E cannot be broken up into individual particle energies. As an 
example consider the interaction of the (N) molecules of a dense gas. The energy 
might be written 

N 

E = 2 m X V * + ' ' - r N) 

r -1 

The kinetic energy of the molecules is separable into single-particle energies but 
the potential energy of a given molecule depends upon the position of all the 
other molecules. Only when the potential energy term is zero (as in a perfect 
gas) is the total energy of the system separable into a sum of single-particle 
energies. 

The extension of the Boltzmann - Planck entropy concept to all systems, 
first made by Gibbs, is discussed in the next chapter and used to derive the 
general relationship between the atomic states of a system and thermodynamics. 
The general equations however can only be evaluated approximately for most 
strongly interacting systems. 

EXERCISES 

6.1 Maxwell suggested that a function f(v x )dv x existed for a perfect gas in 
thermal equilibrium such that Nf(v x )dv x was the number of molecules with 
velocity between v x and v x + dv x . He then assumed (a) the total distribution 
function for all directions can only be a function of the magnitude of the 
velocity (b) the velocity v x of a molecule is independent of the value of v y and 
v z . Show that the distribution function is then of the form A exp Bu x . What is 
the weakness of this derivation? 


6.2 Use the Maxwell distribution of velocities (equation 6.1) and the Boltzmann 
equation S = — kVH 0 to derive the entropy of a perfect gas. 
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6.3 Calculate the energy of the first excited state of an electron in a box of side 
0.01 m. A thermal energy may be defined by 6i = kT x . Find the equivalent 
temperature for the first excited state. Repeat the calculation for a hydrogen 
molecule. 

6.4 Find a general expression for the number of distinguishable microstates of N 
weakly coupled simple harmonic oscillators with individual energy e = ne 0 when 
the total energy is E. 

6.5 Find the probability P(e) of a particle being on each energy level in figure 
6.4(c). Compare this probability with the probability in the most probable 
arrangement. 

6.6 A crystal contains Adatoms, n of which, at temperature T, have been dis¬ 
placed from the body of the crystal to sites on the surface. Show that in thermal 
equilibrium, n «N exp (— j3e 0 ), where e 0 is the energy required to remove one 
atom, if (1 < n <N). 

6.7 Draw all the distinguishable microstates for two particles distributed over 
three states for (a) distinguishable particles (b) bosons (c) fermions and show the 
total is in agreement with the general equations given in the text. 

6.8 Show that exp - a < 1 in equation 6.48 for helium at N.T.P. In silver one 
electron per atom may be treated as free to move through the crystal. Consider 
these electrons as a gas and show that exp — a is not less than 1. 



7 Equilibrium Statistical Mechanics 


In this chapter we wish to consider a general system (with a fixed number of 
particles) in thermal equilibrium. The system may either be isolated from its 
environment (and therefore at constant energy) or be in contact with a thermal 
reservoir at temperature T. The system at constant energy will in fact never be 
completely isolated and should therefore be specified as having energy in the 
range E to E + 8E. The energy of the whole system of interest will be written as 
E. When weakly coupled systems are under consideration, the single-particle 
energy will be written e. 

The system will not remain in one particular quantum state indefinitely even 
when in thermal equilibrium but rather will pass through all the quantum states 
accessible to it as the result of small fluctuations in the surroundings. It is never 
possible for example to shield a system from gravitational effects. The weakly 
coupled system discussed in chapter 6 would in time pass through all the states 
shown in figures 6.3 and 6.4. 

Since macroscopic properties of a system in thermal equilibrium are indepen¬ 
dent of time it should in principle be possible to determine the probability of 
the system being in a particular quantum state (r) by making a series of measure¬ 
ments at successive times, provided that the time between measurements was 
sufficiently long relative to the longest relaxation time of the system. If the state 
r was found to occur m r times in M measurements the probability of the state r 
would be defined as 

P r = — ; NU oo (7.1) 

M 

XP r = - Sm r = 1 (7.2) 

r M r 

The limit in equation 7.1 is necessary since the true probabilities cannot depend 
upon the number of measurements made. Equation 7.2 simply means that the 
total probability of the system being in some quantum state is unity. Once the 
values of P r were known, say at temperature T , for a given system, the macro- 
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scopic properties could be deduced as averages over P r . The mean energy at 
temperature T would be 

E = 2 P r E r (7.3) 

where E r is the energy of the system in the state r. The time-average value of the 
energy given by equation 7.3 could now be identified (for a large system) with 
the thermodynamic internal energy of the system. 


7.1 ENSEMBLE AVERAGES 

The method of taking time-averages over the properties of a single system does 
not lend itself to calculation. An alternative approach to this problem was first 
advanced by Gibbs in 1901, using the methods of classical physics. A simplified 
version of the argument for quantum systems will be given here. 

The single experimental system with specified macroscopic conditions is 
replaced for purposes of calculation by M replicas of itself. The whole collection 
is called an ensemble . The M members of the ensemble are considered to be 
distributed randomly over the allowed quantum states of the system. The time- 
average of equation 7.1 is now replaced by an average over all the members of 
the ensemble at a single time. The averages taken in chapter 6 were therefore 
essentially ensemble averages at constant energy for the special case of a weakly 
coupled system with M equal to the number of distinguishable microstates of the 
system. 

There is no actual proof that the time-average given by equation 7.1 and the 
ensemble averages taken in chapter 6 and further discussed in this chapter will 
always agree. The relationship of the two averages continues to interest mathe¬ 
maticians but the agreement between experiment and the calculations of 
statistical mechanics using ensemble averages seems to be complete. 

When the experimental system of interest is considered to be essentially 
isolated from its environment it is replaced for purposes of calculation by an 
ensemble of identical systems each of which is completely isolated. The energy 
of each system must lie within the range E to E + bE and each system has the 
same volume and contains the same number of particles. Gibbs called this a 
microcanonical ensemble. 

The probability of occupation of a state r of the whole system is now found 
using the assumption of a priori probabilities (section 6.3): ‘an isolated system 
in thermal equilibrium is equally likely to be in any of the accessible states of 
the system.’ 

The probability of the particular state r is therefore 

P r = — — (E < E r < E + 8E) (7.4) 

W(E r ) 

= 0 


(otherwise) 


(7.5) 
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where W{E) is the number of states of the system with energy E or the density 
of states function times the energy range D(E)&E. Notice that it is the density 
of states of the whole system which is involved here. Only for weakly coupled 
systems is it permissible to write this in terms of the density of states of indivi¬ 
dual particles. 

The difficulty that arises when calculations are performed with the micro- 
canonical ensemble is that of finding the states of the system which lie in the 
allowed energy range. This is more difficult than enumerating all the states of 
the system regardless of energy. It is for this reason that the application of the 
microcanonical ensemble is limited to weakly coupled systems as discussed in 
chapter 6. 

Two approaches are possible when the experimental system of interest 
contains a constant number of particles at constant volume in contact with a 
thermal reservoir at temperature T. The entire system of reservoir and system of 
interest could be treated as isolated and the properties of the composite system 
calculated using the microcanonical ensemble. Alternatively we may consider a 
set of M replicas of the experimental system placed side by side in thermal con¬ 
tact and imagine the whole ensemble to be isolated from its surroundings. Gibbs 
called this a canonical ensemble. 

There is no restriction on the energy of the experimental system in this case 
since it may exchange energy with the thermal reservoir. Similarly in the 
canonical ensemble the energy of a single member of the ensemble may fluctuate 
by exchanging energy with its neighbours. The members of the ensemble are 
macroscopic and therefore distinguishable (figure 7.1). They are also weakly 
interacting because the ratio of the number of atoms near the surface to the 


T 



Figure 7.1 The experimental system at temperature T is replaced for purposes of calculation 
by an ensemble of replicas of the experimental system which are free to exchange energy. 
The energy of the whole ensemble is constant. 
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total number in the sample is always small for a macroscopic system. The 
allowed energy states for each member of the ensemble are therefore identical. 

The problem of M weakly coupled distinguishable objects was treated in 
section 6.4. The approximate result in that case was that the probability of a 
particle being in the state r was given by 

p r = e-ia+Per) (7.6) 

where exp — a is given by the normalisation condition 

2 P r = 1 = e" a 2 e-P'r (7.7) 

r r 

In the present ensemble average, M may be considered to be sufficiently large for 
equation 7.6 to hold exactly where the energy is now the energy of a member of 
the ensemble 


P r = e -(“ + ' 3£ » (7.8) 

t-Wr 
Z 

Z = 2 e~P £r 

r 

The function Z is called the partition function or sum over states. The factor /3 
will later be shown to be equal to 1/kT, as in chapter 6. 

The difference between the apparently similar equations 7.6 and 7.8 must be 
carefully considered. The equation 7.8 is the exact result for the probability of 
occupation of the state r of the whole system at temperature T. In general there 
is no way of simplifying this equation. When the particles which make up the 
system are weakly coupled it is possible to express the partition function of the 
whole system in terms of the single-particle energy states but only in the case of 
localised particles and the classical perfect gas does equation 7.6 then result. The 
case of weakly interacting non-localised particles is discussed in section 7.2. 

The equations 7.8 - 7.10 define the canonical ensemble probability of the 
state r of the whole system being found in an experiment. The mean energy of 
the system is defined as in equation 7.3 

2 

E = 2 P r E r = L - (7.11) 

r Ze~ 0Er 

r 

where E r is the energy of the system in state r. 

In the later sections of this chapter it will become apparent that the partition 
function contains all the information required to calculate the thermodynamic 
properties of any system with a fixed number of particles, but before discussing 
this in detail it is of some interest to consider two points concerning ensembles 


(7.9) 

(7.10) 
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in rather more detail. 

The first point concerns the number (M) of members of an ensemble. In the 
classical theory this was simply taken to be sufficiently large not to affect the 
calculations, just as in the number of samples in the time average of equation 
7.1. In the quantum theory however it is possible to give a more physical 
meaning to the ensemble by defining it as proposed by Kittel: ‘every quantum 
state accessible to the actual system is represented in the ensemble by one 
system in a stationary quantum state.’ The number of states is so large, if the 
actual system is always thought of as either macroscopic or as a small system in 
contact with a thermal reservoir, that no approximation is associated with this 
definition. This was however essentially the method used in chapter 6 and is 
therefore only approximately correct for a small system at constant energy. 

The second important point is that for large systems the calculated thermo¬ 
dynamic properties of a system are usually independent of the type (micro- 
canonical or canonical) of ensemble average. It was seen in chapter 6 that a large 
system with energy in the range E to E + 8E behaved as its own thermal reservoir. 
A temperature could be defined for the system using the thermodynamic 
equality 1/T= (dS/dU)v- It will now be shown that conversely a large system 
at temperature Thas a well-defined energy. The mean energy of the system is 
given by equation 7.11. The mean square fluctuation of the energy is found 
using the general statistical result 


(x - x) 2 = (x 2 - (x) 2 - 2xx) - x 2 - (x) 2 (7.12) 


Therefore 


- EE r 2 e- fiE r ZEre-^r 

(E - Ef = + -_ 

Z \ Z 


(7.13) 


This expression can be simplified by noting that the definition of the mean 
energy given in equation 7.11 could have been written 



(7.14) 


because the states of the system are independent of the temperature. Similarly 


E 2 


Z\W 2 / v 


(7.15) 


The derivative is taken with respect to jS rather than to temperature because of 
the simplicity of the resulting equation. The temperature derivative can of course 
be obtained from 


(7.16) 
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The derivative of the mean energy with respect to 0 at constant volume is 

/dE\ _ _ J_ (VZ\ + 1_ /a z\ 2 
VdjSyV Z \3|3 2 /k Z 2 \3]3 )v 

which is equivalent, using equations 7.14 and 7.15, to equation 7.13 apart from 
a change of sign, so 

cun 

The right-hand side of equation 7.17 may be related to the heat capacity of the 
system at constant volume since 

Cv = m - (M) = (a?) m 

V.9 Tjv \()TJv \dpJydT 

Therefore 

(E - Ef = kT 2 C v (7.18) 

The importance of energy fluctuations depends upon the ratio of the square root 
of equation 7.18 to the mean energy of the system 


V(g - E ? = s JjkT'Cy) 
E E 


(7.19) 


At high temperatures (the region of classical physics) the right-hand side of 
equation 7.19 is always of order l/\/N. The relative fluctuations about the mean 
energy are therefore very small for N of the order of 10 20 . The classical gas has a 
mean energy of 3NkT/2 and therefore a heat capacity at constant volume of 
3Nk/2. A solid at high temperatures has energy 3 NkT and heat capacity 3Nk 
(section 7.6). Fluctuations about the mean energy of a large system are only of 
importance at very low temperatures or when the thermal capacity becomes 
very large, as at a phase transition (section 10.4). 

The equilibrium properties of large systems with a fixed number of particles 
can therefore usually be calculated using either the microcanonical or the 
canonical ensemble. The canonical ensemble is the most convenient however 
because the partition function involves all the states of the system (weighted by 
the factor e - ^) whereas the microcanonical ensemble average requires the 
selection from all the states of the system of just those states whose energy lies 
between E and E + bE. 

The study of the fluctuations of a system about equilibrium does require a 
distinction to be made between isolated systems (whose energy cannot fluctuate 
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more than 8E) and systems with only a defined mean energy for which there is a 
finite (though usually very small) probability of the system passing through any 
energy states far from the mean energy. 

A further extension of the ensemble concept may be made to systems with a 
variable number of particles. For example a small part of the volume of a gas 
could be treated as the experimental system. The rest of the gas would now act 
as both a heat reservoir and a particle reservoir. The experimental system is 
replaced for purposes of calculation by an ensemble of systems which are in 
thermal contact and free to exchange particles. Gibbs called this a grand canoni¬ 
cal ensemble . The consequences of allowing the members of the ensemble to 
exchange particles should be clear following the discussion of the mean energy 
in the canonical ensemble. The number of particles in the system will now be 
variable but for a macroscopic system the fluctuations about the mean number 
may be shown to be small. The grand canonical ensemble may therefore be used 
to calculate the properties of a large closed system when the difference between 
N and N is unimportant, and is sometimes easier to use than the canonical 
ensemble. 

The probability of the system being in a state r with energy E r and a number 
of particles N r is given by 


Pr 


- (oiN r +(3E r ) 


(7.20) 


J? = 2 £ e -(aJV r +0E r ) 
N r 


(7.21) 


The function jr is called the grand partition function. 

There are computational advantages in performing statistical-mechanical 
calculations using the grand canonical ensemble rather than the canonical 
ensemble for systems with a constant number of particles and the grand canoni¬ 
cal ensemble is essential when the number of particles is allowed to fluctuate. 
The rest of this book however is restricted to the use of the microcanonical and 
canonical ensembles which are perhaps easier to visualise physically than the 
grand canonical ensemble. An excellent account of the use of the grand canoni¬ 
cal ensemble is given by Kittel (1969). 


7.2 THE PARTITION FUNCTION 

The partition function was introduced in equations 7.9 and 7.10 as a mere 
normalising factor for the probability ( P r ) of a member of the canonical 
ensemble being in a state of energy E r . It was apparent however from the calcu¬ 
lation of the mean energy and the mean square fluctuation about the mean 
energy that thermodynamic quantities can be expressed in terms of the partition 
function. In the next section it will be shown that the fundamental equation for 
any system in thermal equilibrium may be expressed in terms of the partition 
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function. In this section the general properties of the partition function and the 
special case of weakly coupled systems will be considered. 

The partition function expressed as 

Z = 2 e~@ E r (7.22) 

involves the sum over the distinguishable microstates of the whole system. 

The partition function may also be expressed as a sum over the energy levels 
of the systems and the degeneracies of the levels 

z = 2 g r e~ eEr ( 7 . 23 ) 

The form 7.22 will normally be used for simplicity but when actual calculations 
on a given system are to be performed it is necessary to remember that the 
degeneracies of the allowed energy levels are required (as in equation 7.23). 

When the energy levels are closely spaced relative to the thermal energy of the 
system it is possible to transform the sum in equation 7.23 into an integral. The 
degeneracy of a given energy level is now replaced by the density of states func¬ 
tion and the energy becomes a continuous function 

z = jo d E (7.24) 

The energy of the whole system will be a function of six variables (three for 
momentum and three for position) per particle, so equation 7.24 conceals a 
67V-fold integral with N typically 10 22 . 

A more general form of equation 7.24 may be written 

Z = — f e~ fiE d 3N p d 3N r (7.25) 

^ 3 N J 

where N is the number of particles, p the momentum and r the position 
(appendix IV). The notation d 3N p is used to stress the dimension of the 
the integral. The physical interpretation of equation 7.25 (as discussed in the 
appendix) is that in the 6N- dimensional space of momentum and position (the 
phase space) each quantum state occupies a volume of h 3N . This is an example 
of the Heisenberg uncertainty principle which restricts the simultaneous specifi¬ 
cation of momentum and position of one particle in one dimension (Ap x Ax) to 
order h. 

The importance of the weakly interacting systems discussed in chapter 6 
should now be obvious. Only for such systems can the immense complexity 
hidden in the apparently simple equations 7.22 - 7.25 sometimes be reduced to 
the relative simplicity of individual-particle states. 

The partition function for N weakly coupled localised (therefore distinguish- 
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able) particles may be written in terms of the sum over single-particle states 
rather than the states of the whole system. A single particle has a set of allowed 
states with energy e r . The effect of the other particles is contained in the tem¬ 
perature of the system or equivalently in j3. The partition function for a single 
particle (z) is 


Z = 2 e-fcr (7.26) 

and since the partition function for each of the N particles is identical 

Z = z N ( localised) (7.27) 

is the relationship between the single-particle partition function and the parti¬ 
tion function for the whole system for localised weakly interacting systems. 

This argument cannot be extended to weakly interacting systems such as gases 
in which the wave functions of the particles overlap. The particles must now be 
treated as indistinguishable and it is not possible to factorise the partition func¬ 
tion as in equation 7.27 except in the special case of the classical limit of the 
perfect gas. The effect of the particles being indistinguishable was shown to 
reduce the number of distinguishable states by N\ (equation 6.37) in the 
classical limit so the relationship between the individual partition functions and 
that of the whole system is, from equation 7.27. 

z? 

Z =— (non-localised, classical limit) (7.28) 

N\ 

The general case of the quantum gas is best treated using the grand partition 
function and will not be considered in this book. 

The microstates of a system in thermal equilibrium are independent of the 
temperature but may be a function of the volume. The probability of a state 
being occupied is a function of the temperature (from equation 7.9). The parti¬ 
tion function is therefore in general a function of temperature and volume. The 
fundamental equation for any system in thermal equilibrium at given temperature 
and volume must be expressed in terms of the Helmholtz free energy (section 
4.1). The relationship between the Helmholtz free energy and the partition 
function will be discussed in the next section. 


7.3 THE CONNECTION WITH THERMODYNAMICS 

In chapter 6 the connection between the statistical concept of the number of 
distinguishable microstates W(E) of a system with energy in the range E to 
E + 8E and thermodynamics was taken to be the Boltzmann - Planck equation 


S = k In W(E) 


(7.29) 
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Applying this equation to the M replicas of the system, with M r in the state r, 
which form the ensemble 

M\ 

as in section 6.4, since the members of the ensemble are macroscopic and there¬ 
fore distinguishable and using Stirling’s approximation 



= - Mk 2 P r In P r 

r 

The entropy of an individual member of the ensemble is therefore 

S = — k 2 P r In P r (7.30) 

where the values of P r are the appropriate ensemble probabilities for given 
macroscopic conditions. 

In the case of an isolated system in thermal equilibrium the microcanonical 
ensemble leads to 


W(E) 

S = - k 2 P r In P r = k In W(E) (7.31) 

as before, since there are W(E) identical terms of value 1/W(E) x In W(E). 

Equation 7.30 is sometimes taken to be the basic definition of the statistical 
entropy (rather than equation 7.29) and is extended to include systems which 
are not in thermal equilibrium. The thermodynamic entropy is of course still 
restricted to systems in thermal equilibrium. 

It is instructive to derive the canonical ensemble probability directly from 
equation 7.30. The probability of occupation of the states r must always satisfy 
the normalisation condition and in addition the mean energy of a system in the 
canonical ensemble is constant 


2 P r = 1; 2 P r E r = E = constant (7.32) 

The entropy of the system is defined by equation 7.30. In thermal equilibrium 
the entropy of the system must be a maximum. The probability of occupation 
of a state r in the canonical ensemble must therefore be such that equation 7.30 
is a maximum subject to the restraints of equation 7.32. As usual the method of 
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Lagrange undetermined multipliers then leads to 

55 = — k 2 (1 + In P r ) 5 P r = 0 

r 

25 P r f 0; 2E r 8P r = 0 

r r 

2 (a + pE r - In P r ) 5 P r = 0 

and since each term must separately be zero 

P r = e -(«+pi? r ) (7.33) 

The canonical distribution therefore leads to the maximum entropy for a 
system with given mean energy, that is to say corresponds to the state of 
thermodynamic equilibrium. 

The entropy defined by equation 7.30 may be written in terms of the 
canonical partition function by directly substituting into the equation 

n v 


Then 


k&'LE r e-P E * fclnZZe-^ 
r + V 
Z Z 

= k0E + k In Z 


(7.34) 


using the definition of the mean energy (equation 7.11) and the normalisation 
condition for the P r . Then expressing j3 = 1/kT 


or 



+ fclnZ 


(7.35) 


- kT InZ = - TS + E (7.36) 

The right-hand side of the equation is simply the definition of the Helmholtz 
free energy if the mean energy (E ) is taken to be equivalent to the thermo¬ 
dynamic internal energy. Therefore 

F = - kT In Z (7.37) 

is the fundamental equation from which all other thermodynamic functions can 
be derived using the results of part I. The form of equation 7.37 is correct since 
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the partition function is a function of volume and temperature. 

The problem of finding the fundamental equation for any system discussed 
in chapter 4 has therefore become: ‘evaluate the partition function of the system’. 
It is in fact only possible to do this exactly for a limited number of systems, but 
other systems (such as dense gases) may be treated by methods of successive 
approximation similar to the perturbation theory of quantum mechanics. 

The form of equation 7.37 shows that In Z is an extensive quantity (propor¬ 
tional to the number of particles in the system) as may be seen directly for the 
localised weakly interacting systems discussed in section 7.2 (equation 7.27) 

F = — kT\nZ = — NkTXnz (localised) (7.38) 


The thermodynamic functions of a system in thermal equilibrium may there¬ 
fore be expressed in terms of In Z. The functions divide into two groups, (a) 
those involving only the derivatives of In Z and (b) those involving In Z. In the 
first group are the energy functions such as the mean energy 


2E r e-P E r 

— r 

E =- 

Z 


ainZ \ 
dp )v 


(7.39) 


and the specific heat at constant volume 


'dE\ = /bE\ dp 
$t) v (dffJvdT 



(7.40) 


These quantities are unchanged if a constant term is added to the logarithm of 
the partition function. The distinction between the partition functions of 
localised and non-localised particles (equations 7.27 and 7.28) is therefore not 
important when the mean energy or the specific heat are under consideration. 
However the group of functions which involve In Z itself, such as the entropy 
and the Helmholtz free energy, do change if In Z changes by a constant. The 
distinction between equations 7.27 and 7.28 is then of vital importance, as will 
be seen in section 7.5. 

A further distinction between the mean energy and the entropy of a system is 
that the mean energy is only defined to an arbitrary constant but the entropy is 
independent of the choice of constant. Suppose that the position of the energy 
zero is changed by an amount E 0 . Then the new partition function becomes 

Z' = = e-* £ o Ze~P Er 

r r 

= Ze-^« 


Therefore 


InZ' = InZ - PE 0 
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The mean energy becomes (from equation 7.39) 

E 7 = E + E 0 

but the entropy (using equation 7.35) is unchanged since 

= fclnZ' + kf3E' 

= klnZ + kfiE = S 

The concepts introduced in this section will now be applied to the particularly 
simple systems of localised weakly interacting systems discussed in chapter 6 and 
to the classical perfect gas. The advantages of working with the canonical 
ensemble rather than the microcanonical ensemble should be quickly apparent. 

7.4 LOCALISED SYSTEMS 

The partition function for any weakly coupled localised system involves simply 
the single-particle states through equation 7.27 

\ N 

The A'-particle two-level system with single-particle non-degenerate energy 
levels 0, e reduces to 



Z = z N = (1 + <>*)** 

In Z = N\az = N\a{\ + e"**) 


The number of particles in each level is then 


N e = NP e 
N 0 = NP 0 


N 

1 + e -(3e 
N 

1 + e-0 e 


N e = N 0 e -(3e 


(7.41) 

(7.42) 

(7.43) 


in agreement with the approximate calculation of the most probable number 
of particles in each state given in equation 6.14. The results in equations 7.41 - 
7.43 are the exact results for the mean number of particles in each state at 
temperature T. 

The mean energy of the system may be found from equation 7.11 


- Wee-* 36 _ Ne 

~ 1 + e-0 e ~ 1 + e^ 


(7.44) 
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and the heat capacity at constant volume by differentiating equation 7.44 with 
respect to temperature 


C v 



e* e 

(1 + e ^) 2 


(7.45) 



Figure 7.2 The mean energy and heat capacity of a two - level system as a function of 
temperature. 


The mean energy and the heat capacity of the two -level system are shown as 
functions of the reduced temperature ( kT/e ) in figure 7.2. The entropy of the 
two-level system may be found from either equation 7.35 or the thermodynamic 
relation 


where 


S = 


df\ 

dTjv 


/d0\ 

Wv W 


F = - kTlnZ = - - In(1 + e"^) 

P 


Therefore 
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In the low-temperature limit (T < elk) the entropy approaches zero as was 
noted in chapter 6 and in the high temperature limit ( T>e/k ) the entropy 
approaches Nk In 2. The entropy is shown as a function of reduced temperature 
in figure 7.3. 

The important point, already stressed in chapter 6, is that the properties of 
the system depend upon the ratio (e/kT). The ‘high-temperature limit’ does not 
therefore mean necessarily a large number on the thermodynamic scale but 
simply that the energy gap (e) is smaller than the thermal energy ( kT ). A 
quantised system with sufficiently small e may therefore be in the high-tem¬ 
perature region at all temperatures accessible to experiment. In this case the 
entropy of the two-level system is Nk In 2 independent of temperature and the 
specific heat at constant volume, TX$SfoT)y is zero (as shown in figure 7.2). 



Figure 7.3 The entropy of a two - level system as a function of temperature. The value 
Nk In 2 is shown as a broken line. 


The simplest example of a two-level system is a nucleus of spin-§ in a mag¬ 
netic field (figure 7.4). The energy level is two-fold degenerate in zero field but 
separates into two non-degenerate levels in a magnetic field {jx 0 H *) with energies 
±IJi 0 H* where n is the magnetic moment of the nucleus. The energy zero is taken 
at zero field value and the splitting of the levels (A) is given by 

A = 2n 0 H* 

The single-particle partition function is now 


z = e-0*/ 2 + eP A l 2 


(7.47) 
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The mean energy of the system is 


E = MU 1 - e<?A > 

2 (1 + e^ A ) 


(7.48) 


and the heat capacity is unchanged from equation 7.45. The nuclear magnetic 
moment is so small that in most cases the maximum magnetic field available 
will leave A/kT small at normal experimental temperature (exercise 7.1). The 
specific heat is then found to be proportional to the inverse square of the tem¬ 
perature (exercise 7.2). The measurement of nuclear specific heats [when they 
are large enough to be distinguished from other contributions to the specific heat 
(section 10.3)] has been of particular importance in investigating the internal 
magnetic fields found in ferromagnetic materials such as cobalt. In these materials 
a large effective magnetic field is found at the nucleus even when the external 
field is zero. 



Figure 7.4 A particle with spin 1/2 has a doubly degenerate energy level in zero magnetic 
field. The level splitting is proportional to the magnetic field. 


The partition function for N weakly coupled simple harmonic oscillators is 
also straightforward. The energy levels of the individual oscillators are given by 
quantum mechanics to be 

e = (« + y) e 0 ; e 0 = hv 

where n is an integer (0,1,2,...), h is the Planck constant and v the frequency of 
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the oscillator. Then the partition function for a single particle is 

z = 2 e” (/l ^^ e ° = e~P € ol 2 2 e ~" e ° 
n n 

= e ~0e o /2 (1 + e -/3e 0 + e ~2^e 0 + ) 

The term in the bracket is an infinite geometric series with the value 1/(1 — e~^ € °) 
so 


e-^0/2 

1 - e _/3e ° 


and 


In Z 


Nlnz = - N 



+ In (1 - 


e 



The mean energy is 


E = 


/ 81 nZ\ 

VWJv 


Ne 0 



which is equivalent to 


(7.49) 


(7.50) 


(7.51) 


E 



(7.52) 


At low temperature the mean energy therefore goes to Ne 0 /2. This term arises 
from the condition that e = (n +\)e 0 rather than the value ne 0 given by the old 
quantum theory. The term Ne 0 l2 is called the ‘zero point energy’ and its effect 
is to preserve at low temperature the uncertainty in the position of the oscillator 
required by the Heisenberg uncertainty principle. In the high-temperature limit 
(j3e 0 <l) the mean value of the energy becomes NkT. This is an example of the 
equipartition theorem which applies in the classical high-temperature limit 
(section 7.6). 

The heat capacity of the system follows immediately from equation 7.51 


C v = 



= Nk e 0 2 0 2 



(7.53) 


and is shown in figure 7.5. The low -temperature limit (j3e 0 ^ 1) is 


C v = Nke 0 2 0 2 


(7.54) 
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which decreases exponentially to zero as 71* 0 k. In the high-temperature limit 

(to < i) 


C v = Nk 


(7.55) 


The high temperature specific heat therefore goes to a constant value (Nk) 
rather than to zero, as does the two-level system. This is because the two-level 
system cannot, at positive temperatures, have more than N/2 particles in the 
upper state (section 10.5). Since the mean energy of the two-level system cam 
not increase beyond Ne/2, the heat capacity which involves the derivative of E 
goes to zero at high temperature. 



Figure 7.5 The heat capacity of a system of weakly interacting one-dimensional simple 
harmonic oscillators as a function of temperature. 


The simple harmonic oscillator is of particular importance in the theory of 
black-body radiation (section 10.2). It also serves as a crude model for the 
lattice specific heat of solids (the Einstein model) in which each atom is con¬ 
sidered to vibrate independently (section 10.3). 

One further simple example of a localised system will be considered in this 
section. The thermodynamic properties of rubber were seen in section 5.6 to be 
most unusual since the length of a rubber fibre at constant tension is found to 
decrease with an increase of temperature. A very simple model to account for 
the elastic properties of rubber considers the molecular groups as forming the 
links of a chain. A one-dimensional model would therefore separate the links 
into two groups (figure 7.6) which will be denoted A if they act to increase the 
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Figure 7.6 A one-dimensional model for rubber. The links may point either to the right or 
left without change of energy. In the diagram the links have been displaced from the 
original line in order to clarify the arrangement. 


length of the chain and B if they act to reduce it. The energy of a link is inde¬ 
pendent of its orientation and will be taken to be zero. This model is plausible 
since rubber can be stretched to about ten times its length without exceeding 
the elastic limit. The partition function 

z = Ze-Wr 

r 

involves only one term since E r is equal to zero. The degeneracy of this term if 
there are N links with n A in group A and n B in group B is then simply the 
partition function 


n A \n B \ 

N = n A + n B (7.56) 

The length (. L ) of the chain is given by 
L = (/i a - ”b)1 

where / is the length of one link. The Helmholtz free energy is 

F = - kT In Z = - kT (N In N - n A In n A - n B In n B ) (7.57) 

where Stirling’s approximation has been used. 

The tension (.90 in the chain is given (assuming that the volume change of 
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d«A 
d L 



n A 


«B / 


(remembering that dn B = - d«A from equation 7.56) 


^= —In 
21 



(7.58) 


where L M =M is the maximum length of the chain. When the tension is small 
(L <^Z, M ) the equation becomes 




kTL 


IL 


M 


or 

gr 

L = constant — (7.59) 


The length of the fibre therefore decreases when the temperature is raised at 
constant tension. 

The entropy of the fibre is given (from equation 7.57) by 

S = k(NlnN - n A lnn A - « B ln/2 B ) (7.60) 

(remembering that F=U- TS and in this case U is zero). The entropy is zero 
when the chain has maximum extension (n A =N,n B = 0) and a maximum when 
the extension is zero ( n A = « B ). The tension in the fibre arises not from any 
change in the internal energy with length (as in the case of a metal wire) since 

(--) - ° 

\bLjT 

but from the need to overcome the entropy-driven tendency of the fibre to curl 
up. 

This one-dimensional model of rubber cannot be expected to give exact 
agreement with experiment but does show the main features of the behaviour of 
a rubber fibre or a polymer. A three-dimensional treatment is also possible but 
also fails in detail because the volume change of the fibre cannot be neglected. 
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7.5 CLASSICAL PERFECT GAS 


The partition function for systems in which the energy levels are sufficiently 
close together was shown in section 7.2 to be 

Z = J” D(E ) e~ pE d E 

In the special case of the classical monatomic perfect gas the partition function 
may be factorised as discussed in section 7.2 to 


Z = 


N\ 


z 


= D(e) e de 


(7.61) 


where z is the single-particle partition function. The density of states function 
(apart from the spin factor) given in appendix IV then leads to 



(7.62) 


where m is the mass of one molecule of the gas. The Helmholtz free energy of 
the whole gas is then 


F = — kTXnZ = - kT In (— \ 

\N\) 0 

= - Mrjln V-\ In(3 +yln ^^-lnW+lJ 

using Stirling’s approximation for In /V!. 

All the properties of the monatomic perfect gas can now be derived from 
equation 7.63. The equation of state is given by 


P 



N 

pv 


= nkT 


(7.64) 
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Notice that this result confirms the identity ft = 1 jkT. The mean energy may be 
found using equation 7.39 or more simply using the identity (equation 4.6) 


U = E = - T 2 



v 


Therefore 


E = | NkT (7.65) 

The internal energy of a classical perfect gas is independent of the volume of the 
system, in agreement with Joule’s (experimental) result 

= 0 (perfect gas) (7.66) 


3£A 

dv) 


which was used throughout part I. 

The mean energy of the perfect gas is related to the equation of state, using 
equation 7.64 and 7.65, by the equation 

PV = I E (7.67) 

This equation is useful because it is also applicable to the Fermi - Dirac and 
Bose - Einstein gases (section 10.1). The factor of 2/3 is not however correct for 
particles whose velocity is of the order of the speed of light, where the Einstein 
relativistic equation relating energy and momentum is required (exercise 7.4). 

The heat capacity at constant volume is simply 



The mean square fluctuation in the energy of a monatomic perfect gas is (from 
equation 7.17) 


(E - E) 2 = kT 2 C v = | Nk 2 T 2 
and the importance of fluctuation is, fiom equation 7.19 


[(E E) 2 ] 2 ,2 

E V 3 N 


(7.69) 


The mean energy of a classical perfect gas is therefore usually extremely well 
defined since N is typically ~ 10 20 . 
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The entropy of a monatomic perfect gas (from equation 7.63) is 

i= -©K-“G inr+i "(^)] +s ” (7 - 70) 

where S 0 is a constant for a given gas 

o r ^ (2irmk\ 5 1 

s ° = “U ,n + d (7J1) 

This is the Sackur - Tetrode equation for the entropy of a monatomic perfect 
gas in the classical limit. 

The entropy of the perfect gas is in agreement with the thermodynamic result 
(equation 3.41) but also contains a definite value for the constant S 0 . Since S 0 
contains the Planck constant it could not have been arrived at by the methods of 
classical physics. It is in this sense that we refer to the perfect gas as the classical 
limit of the quantum gases rather than as simply the classical perfect gas. 

The original classical theory of the perfect gas contained an arbitrary constant 
instead of the Planck constant and, more seriously, used the partition function 
for weakly coupled distinguishable particles 

Z = z N 

The entropy given by equation 7.70 then contains In V rather than In ( K/TV) and, 
as discussed in chapter 3, this form for the entropy is not extensive and wrongly 
suggests that the mixing of identical gases leads to an increase in the net entropy 
of the system. This problem was first appreciated by Gibbs and is sometimes 
called the Gibbs paradox. Gibbs introduced the factor TV! into equation 7.61 to 
bring statistical mechanics into agreement with thermodynamics, but its physical 
significance did not become apparent until quantum mechanics showed that the 
particles of the gas must be treated as indistinguishable. 

The Sackur-Tetrode equation for the entropy of a perfect gas may be 
verified directly by experiment. The total change in the entropy of an element 
from the solid state near 0 K to the liquid phase and then to the vapour may be 
measured directly (as discussed in chapter 3). 

AS = f J m C s d (In 7) + TL + fj’ b C, d (In T) + ^ (7.72) 

J o T m JT m Tb 

where C s is the heat capacity of the solid, T m the melting point, / f the latent 
heat of fusion, C x the heat capacity of the liquid, T h the boiling point and / v 
the latent heat of vaporisation. The value given by experiment using equation 
7.72 may then be compared directly with the calculated value from equations 
7.70 and 7.71 which contain no adjustable parameters. The agreement is found 
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to be extremely good and is perhaps the single most convincing verification of 
the consistency of thermodynamics and the method of ensemble averaging used 
in statistical mechanics. 

The entropy given by equation 7.70 is correct for a monatomic gas of particles 
with no nuclear or electronic spin. When the nucleus of the atom has spin / there 
will be (21 + 1) degenerate states per atom or (21 +1)^ states for the gas. The 
entropy will then increase by Nln(2I +1) but the mean energy of the system is 
unchanged. Similarly when the electron spin of the atom is s , the entropy will 
increase by AHn(2s + 1) and a gas of atoms with both nuclear spin I and atomic 
spin s will have a total entropy of N[\n(2I + 1) + ln(2s + 1)] greater than that 
given by equation 7.70. 

The energy of a monatomic perfect gas is due to the kinetic energy of the 
atoms 


N 

r -1 

where p r is the momentum of the rth atom. The only other possible contribu¬ 
tion to the partition function is that of the electronic states of the atom but the 
separation between the ground state and first excited states of the atom is so 
large (exercise 6.3) that at normal temperature the atom will always be in the 
ground state. 

The energy of diatomic or polyatomic gases is due to the kinetic energy of 
the whole molecule and also to the internal degrees of freedom of the molecule 
such as rotation and vibration. The frequencies of vibration and rotation of a 
molecule are so different that they have little effect on each other. The partition 
function for the whole molecule can then be written (ignoring the spin terms) 

Z — ZfZj-QtZy 

where z t is the translational (kinetic) partition function (equation 7.62), z rot 
the rotational partition function and z v the vibrational partition function. This 
separation follows immediately (if the energies are independent) from 

z = 2e“^ Cr = 2 e~^( e t +e rot +e v) 
r 

The relationship between the single -molecule partition function and that for the 
whole gas of N molecules is then 

Z= (~) Z ^ N Z * N ( 7 - 73 ) 

where the first term is identical with the result for a monatomic gas. The 
electronic partition function could also be included in the product on the 


1 iV 1 
1 r= 1 


2m 
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right-hand side of the equation if the temperature of the system was sufficiently 
high for a significant number of molecules to be raised above the ground state. 

The rotational energy states of a rigid system of two molecules (which is a 
goo & approximation to the behaviour of real molecules) are given by quantum 
mechanics to be 


^rot 


h 2 

8tt 2 / 


J(J + 1) 


(7.74) 


where J is an integer (0,1,2,...) and / the moment of inertia of the molecule about 
the centre of mass. The degeneracy of a rotational energy level is (2 J +1). 

The partition function is therefore 

• J(J+i)h 2 p 

z rot = 2 (2/ + 1) e (7.75) 

/=0 


The mean rotational energy is 


^rot 


Nh 2 00 _ 

-Y /(/+ 1) (2J + 1) e 87 r 2 I 

~ _ IM±1 

£ (2J + 1) e 

/=0 


(7.76) 


This expression and that resulting from the heat capacity are difficult to handle, 
but in the low-temperature limit (P > S7T 2 I/h 2 ) it is clear that only the first term 
will contribute to the sum 


E 


rot 


3Nh 2 - 

- e 

4n 2 I 


h 2 p 
4 t r 2 I 


(7.77) 


The mean rotational energy therefore approaches zero at low temperature 
(where the low-temperature region is defined by T < h 2 lSn 2 kr) and the rota¬ 
tional contribution to the heat capacity also goes to zero at low temperature. 

A gas of diatomic molecules will in principle have the same heat capacity as a 
monatomic gas at sufficiently low temperature but in practice this effect is only 
observed for hydrogen. 

The importance of a characteristic temperature (related to the spacing of the 
energy levels by the equation kT g = A E) for quantised systems was discussed 
briefly in section 6.3 in connection with the two-level system. It should now be 
clear that in general a system will have a number of characteristic temperatures 
such that at a given experimental temperature some aspects of the system may 
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be in the high-temperature limit while others are in the low-temperature limit. 
The spacing of the translational energy states in a box of macroscopic dimensions 
is so small that for a gas in the classical limit these states may always be treated 
in the high-temperature limit and the partition function treated as an integral* 
rather than a sum. The vibrational (and the electronic) energy states are so 
widely separated that at normal temperature only the ground states are occupied. 
These two aspects of the molecule are therefore usually in the low-temperature 
limit. Finally the rotational states of heavy molecules can usually be treated in 
the high-temperature limit, which will now be evaluated. The lightest molecules 
(H 2 , HD) are in the transition region between the two limits below about 100 K. 

The high-temperature limit of the mean rotational energy may be found by 
evaluating the sum in equation 7.76 for a large number of terms but the correct 
result for the heat capacity may be found more simply by noting that the density 
of rotational states per unit energy range is given by 

Number of states 4(/+1) 

Energy change 2(/+1) ( h 2 /87r 2 /) 

(considering the degeneracies and energy separation of two adjacent states). 

Therefore in the high-temperature limit (j3 < &ir 2 I/h 2 ) where the thermal 
energy is much greater than the separation between the energy levels, the parti¬ 
tion function (equation 7.75) may be transformed into an integral treating the 
density of rotational energy states as constant. 

Zrot = constant e -/3e de 

The mean rotational energy of a molecule in the high-temperature limit is 
then 

r ee^ c de 

= —-= kT (7.78) 

f e~P € de 

J o 

The rotational specific heat per molecule is then simply (97/3 T) which is equal 
to k. The total heat capacity of a diatomic gas in the high-temperature limit is 
therefore 


C v = |M + Nk = \Nk (7.79) 

or SRj2 for 1 mole of the gas, but the heat capacity decreases to 3R/2 in the 
low-temperature limit. 

The vibrational and electronic contributions to the heat capacity are not of 
importance at normal temperature and will not be considered further. The 
rotational partition function (equation 7.75) becomes more complicated for the 



Equilibrium Statistical Mechanics 147 

case of molecules of identical atoms, and in particular for molecular hydrogen, 
but will not be considered further. It should be clear from this section however 
that the rotational and vibrational specific heats could not be understood in terms 
of classical physics. 

The heat capacities of three very different systems have now been seen to 
take on the same form in the high-temperature limit. The simple harmonic 
oscillator (3 k), the free particle (3k/2) and the diatomic molecule (5k/2) all have 
heat capacities which are independent of temperature and of the same order of 
magnitude. These are examples of a general theorem of classical physics called 
the law of equipartition of energy which will be discussed in the next section. 


7.6 THE EQUIPARTITION OF ENERGY 

The simplest form of the equipartition theorem may be stated: ‘a classical 
system in thermal equilibrium at temperature T has a mean energy of JcT/2 per 
degree of freedom. One degree of freedom corresponds to each squared term in 
the expression for the energy of the system’. 

A few simple examples will be discussed before proving the general theorem. 
The energy of one molecule of a monatomic perfect gas may be written 

e = \mv 2 - \m{v x 2 + v y 2 + v z 2 ) 

since it is free to move in three dimensions. The mean energy is therefore 3kTj2 
and the specific heat 3k/2, in agreement with equation 7.68. 

The diatomic molecule has to be considered to have only two degrees of 
rotational freedom (which cannot of course be proved in classical physics) and 
then has the required value of 5k/2. It is interesting to note that Maxwell and 
Boltzmann were not impressed by the apparent agreement between the measured 
specific heats of diatomic molecules and the equipartition theorem and stressed 
that to consider a molecule to have only two internal degrees of freedom was an 
unwarranted assumption. The quantum-mechanical calculation discussed in the 
last section should have made it clear that the other degrees of freedom are not 
excited at normal temperature and do not contribute to the specific heat. 

The equation of a one-dimensional simple harmonic oscillator is 

e = \mv x 2 + \cx 2 

where c is a constant. In the classical limit the mean energy will be (kTj2 + kT/2) 
since both terms contain a square. This result agrees with the high-temperature 
limit calculated in section 7.4 (equation 7.55) but at low temperature the mean 
energy was seen to go to the zero-point energy, not to zero as suggested by the 
equipartition theorem. 

The equipartition theorem can be stated in a more general form than that 
given at the beginning of this section and since the general proof is no more 
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difficult it will be derived first and then various special cases considered. The 
classical partition function for the system will be written so as to distinguish a 
term in the energy (Ef) which may be a function of either momentum (p x ) or 
position (*)• When E 9 is a function of momentum 

e -P(.E + E') d 3N p d 3N r= J*e - ^ d 3Ar-1 p d 3N rJe~^ E> dp x (7.80) 

where the first integral on the right-hand side excludes d p x . The second term on 
the right-hand side is now integrated by parts to give 

Je-^' d Px = J + pfpx d p x 

The first term on the right -hand side will be zero at both limits, assuming that 
the energy goes to infinity as the momentum goes to plus or minus infinity. 
Therefore 


1 

P 


'-“'ip, 

\dPxJ 


L 


e -fiE d 3 N p d 3/V,. 


(7.81) 


but the right-hand side is simply a mean value so 


Px 



1 

P 


kT 


which may be written 

Px»x = kT 


since 

d E' 

-= v x 

dp x 


(7.82) 


(7.83) 


The general result given in equation 7.82 may be simplified when the energy is 
proportional to the square of the momentum 

E' = ap x 2 


to give simply 


E' = \kT 


(7.84) 
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in agreement with the first statement of the equipartition theorem. 

The energy E 9 may refer either to a single-particle energy in a weakly coupled 
system (as in the examples given at the beginning of this section) or else to the 
energy of the system as a whole. For example the energy of a galvanometer 
movement with moment of inertia / suspended from a fibre with torque constant 
c may be written 


E = \c0 2 + \l(0 ) 2 (7.85) 

where 0 is the angle of rotation and 0 the angular velocity (d0/d t). The mean 
energy of the suspension is therefore (kT/2 + kT/2) or kT. 

When the energy term E 9 is considered as a function of position, an equation 
similar to equation 7.81 may be obtained provided the energy dependence is 
such that on integrating by parts to obtain 

* e-**' dx 

the first term on the right-hand side vanishes at the limiting values of x. This 
means that the energy must increase without limit as b goes to plus infinity and 
a to minus infinity or that the lower limit is restricted to x equal to zero. A 
bound particle (such as a simple harmonic oscillator) satisfied these conditions. 

The equivalent equation to equation 7.82 when the energy E ' is a function of 
position is then 




which may be written 


(7.86) 


displacement x opposing force = kT (7.87) 

In the special case that E' is proportional to x 2 the simpler result 

E* =\kT (7.88) 


is again obtained. 

The equipartition theorem is obviously a powerful tool, particularly since it 
can be applied to macroscopic bodies, but is restricted to situations which may 
be described by the classical limit of quantum mechanics. The use of the equi¬ 
partition theorem to obtain the mean square fluctuation about the mean energy 
of a system in thermal equilibrium will be discussed in the next section. 
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7.7 FLUCTUATIONS ABOUT EQUILIBRIUM 


The general result for the mean square fluctuations of a variable x was given in 
equation 7.12 

(x - x) 2 = x 2 - (x) 2 

The value of x 2 can often be found using the equipartition theorem and is equal 
to the mean square fluctuation of the system if x is zero. The mean square 
angular rotation of the galvanometer suspension with torque constant c for 
example (equation 7.85) is given by 

\ c 0* =\kT 

The mean angle 0 is obviously zero since the suspension is equally likely to turn 
clockwise or anticlockwise from the equilibrium position so 

V(A~- 8? = 7— (7-89) 

c 

A suspension at temperature T will rotate randomly about the equilibrium 
position such that the root mean square amplitude of the angular displacement 
is given by equation 7.89. This fluctuation in the angle 0 will mask the rotation 
due to a small steady current I. The rotation due to the current is proportional 
to cB so the minimum observable current is given by 

/min « y/(cKT) (7.90) 

The thermal fluctuations (noise) can only be reduced for a given suspension by 
reducing the temperature of the system. 

The thermal noise present in an electronic amplifier may be analysed in a 
similar fashion to the galvanometer suspension and is seen to provide a funda¬ 
mental limitation to the size of signal that can be detected. The amplifiers used 
in radioastronomy and other subjects where the signals are extremely weak are 
sometimes immersed in liquid helium (4.2 K) to achieve a theoretical improve¬ 
ment in performance of \/(300/4.2) compared to the same amplifier at room 
temperature. There are many other problems apart from thermal noise in the 
design of amplifiers to detect small signals, but thermal noise is the final limita¬ 
tion on the performance of the system. 

The mean square fluctuation in the velocity of a gas molecule in a classical 
gas at temperature T is given immediately by equation 7.12 since the mean 
velocity is zero (remember that velocity involves magnitude and direction and 
there can be no preferred direction in the gas in thermal equilibrium) so 

-_ 3 kT . 

(v - v) 2 = v 2 = - (7.91) 

m 
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The mean square fluctuation in the velocity increases with temperature and is 
inversely proportional to the mass of the particle. 

The equipartition theorem and the general equation 7.12 can therefore be 
used to obtain useful information about the mean square fluctuations about 
equilibrium of any system which is in the classical regime and has a mean value 
of zero for the property under consideration. The distribution of fluctuations 
however is not characterised by equation 7.12 alone and a complete theory for 
a particular system is required when more than just the mean square value of the 
fluctuation about the equilibrium value is required. The Maxwell velocity dis¬ 
tribution for the molecules of a classical perfect gas has a Gaussian form (equa¬ 
tion 6.1 and chapter 8) with mean value zero and mean square deviation given 
by equation 7.91 but the Gaussian form could not be deduced from equation 
7.12. 

The general equation for the mean square fluctuation of the thermal energy 
of any system in thermal equilibrium at temperature T (equation 7.18) 


(E - Ef = kT 


has already been discussed in section 7.1. The ratio of the root mean square 
fluctuation of the energy to the mean energy was seen to be of order 1 / 'y/N in 
the region where the equipartition theorem could be applied. 

Fluctuations are of importance however when Cy becomes very large, as at the 
critical point for a phase transition, such as liquid to vapour (section 10.4). The 
thermodynamic result (equation 5.11) 


Cp — Cy — 



and the condition (equation 2.13) 



(7.92) 


at the phase transition show that (Cp - Cy) goes to infinity at the critical point 
and from experiment it appears that Cy also approaches infinity, although more 
weakly than C P (section 10.4). The isothermal compressibility of the system goes 
to infinity at the critical point (equation 7.92) and large fluctuations occur in 
the density of the system. This is to be expected since, for example, as the gas is 
cooled droplets of liquid will begin to form in the vapour. The grand canonical 
ensemble may be used to prove that the mean square fluctuation in the number 
of particles in volume V is given by 


(N - Nf 


-kTN 2 

\a vJt 


(7.93) 
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The importance of fluctuations is given by 



(7.94) 


The right-hand side is simply equal to \/y/N for a perfect gas. Fluctuations in 
the number density are therefore unimportant (except at the critical point when 
(9P/9 V) T goes to zero) unless the volume V under consideration is very small. 


7.8 CONCLUSION 


The problem of the thermodynamic fundamental equation discussed in part I 
has been shown in this chapter to be equivalent, for a system with a fixed 
number of particles, to the evaluation of the partition function of the system. 
The fundamental equation may then be written in terms of the Helmholtz free 
energy and the partition function of the system 


F = - kTkiZ 

(7.95) 

Z = £ t~& E r 

(7.96) 


The partition function is a function of temperature (since j3 = 1/kT) and of the 
volume of the system if the states r depend upon the volume, so the right-hand 
side of equation 7.96 is a function of ( T , V) as was required if the Helmholtz 
free energy was to define the fundamental equation of the system. 

Statistical mechanics goes beyond thermodynamics however in that it pro¬ 
vides a means for the study of the fluctuations of the properties of the system 
about their equilibrium values. When the fluctuations are small (as is usually the 
case for large systems) the mean value of the energy of the system has been seen 
to be well defined and may be treated as the thermodynamic internal energy 
discussed in part I. 

The original statistical definition of the entropy due to Boltzmann 
was seen to be correctly extensive but did not prohibit a decrease in the entropy 
of an isolated system in thermal equilibrium. The probability of a significant 
(macroscopically observable) decrease in the entropy of a large system was 
however found to be vanishingly small. 
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The Gibbs definition of the entropy 

S = - k'LPrlnPr (7.97) 

avoids this difficulty of principle inherent in the Boltzmann entropy since the 
ensemble probabilities ( P r ) are independent of time for a system in thermal 
equilibrium. The entropy defined by equation 7.98 cannot therefore decrease. 
The third law of thermodynamics in the form (equation 4.22) 

S->s 0 T^o k 

where S 0 is a constant, is equivalent to the statement that the sum on the 
right-hand side of equation 7.98 becomes independent of temperature at 
sufficiently low temperature. If the lowest energy level of the system is #-fold 
degenerate then 


So = king (7.98) 

which is certainly negligible by macroscopic standards (« Nk ). As was remarked 
earlier it is thought that the true ground state of any system is non-degenerate 
(g = 1) and then S 0 is equal to zero, which was the strongest statement of the 
third law given in chapter 4. This simple formulation of the third law is not 
however really satisfactory. The thermal energy required to excite a particle in a 
box from the ground to the first excited energy state is only of the order of 
10“ 14 K but the concept of the entropy becoming a constant at ‘low tempera¬ 
ture’ was derived from experiments at temperatures near 4 K. There are there¬ 
fore many particles in excited states at even the lowest temperature accessible 
to experiment. 

The explanation of this discrepancy between the observed temperature at 
which the entropy becomes independent of temperature and that expected if the 
system was required to be in its ground state, is once again related to the fact 
that the degeneracy factor g in equation 7.99 may be very large by normal 
standards (say 10 10 ) without making a significant contribution to the entropy 
of the system (equation 6.20). In terms of the density of states function, 
equation 7.99 may be written 

S = klnD(E)dE (7.99) 

Casimir first pointed out that provided D(E)dE increases sufficiently slowly 
with energy the contribution to S 0 will be immeasurably small at experimental 
‘low temperature’ even though many states of the system are occupied. The 
third law of thermodynamics should therefore be understood in this rather more 
restricted way than the simple statement involving the true ground state of the 
system which is usually not accessible to experiment. 

The theoretical framework of statistical mechanics and thermodynamics for 
systems with a fixed number of particles in thermal equilibrium has now been 
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taken as far as is necessary for the purposes of this book. In the next two 
chapters, the original version of statistical mechanics (the kinetic theory of the 
classical perfect gas) will be discussed for both equilibrium and non-equilibrium 
states. A number of further examples of the application of the theory developed 
so far will be found in the last part of the book. 


EXERCISES 

7.1 Calculate the value of A/kT for a nucleus of spin \ with magnetic moment 

5 x 10~ 27 JT” 1 in a field of 1 T(10 kG) at a temperature of 1 K and repeat the 
calculation for an ion with magnetic moment 9 x 10 -24 JT -1 . 

7.2 Show that in the high-temperature limit the heat capacity of a two-level 
system is proportional to the inverse square of the temperature. 

7.3 Find an expression for the entropy of a system of one-dimensional simple 
harmonic oscillators as a function of temperature. 

7.4 Show that a gas of particles moving sufficiently close to the speed of light 
(c) to have the energy related to the momentum by e = cp obeys the equation 
PV = E/3. 

7.5 The temperature of the chromosphere of the sun is approximately 5 x 10 3 K. 
Find the relative number of hydrogen atoms in the energy levels with n = 1,2,3,4. 
The Rydberg constant is 1.1 x 10 7 m -1 . 

7.6 The moment of inertia of a molecule is 4.6 x 10 -48 kg m 2 . Find the relative 
populations of the J = 0,1,2, 3,4 rotational energy levels at 300 K. At what 
temperature are the J = 2 and / = 3 levels equally populated? 



PART III 

KINETIC THEORY 



8. Kinetic Theory of Gases-I 


The kinetic theory of the classical perfect gas (largely due to Maxwell and 
Boltzmann) was the first statistical theory to be introduced into physics. The 
equilibrium results of kinetic theory can now all be obtained as special cases of 
the general results of statistical mechanics obtained in chapter 7 but the detailed 
nature of the kinetic theory can give added insight into processes such as the 
pressure exerted by a gas on the wall of a container. The kinetic theory approach 
can also be extended to non-equilibrium situations and, as shown in the next 
chapter, may be used to calculate the transport coefficients (thermal conductivity, 
coefficient of viscosity, etc.) of a perfect gas. The kinetic theory of the perfect 
gas is of great importance in applied physics because the results are directly 
relevant to real gases at low pressure as in vacuum systems, the upper atmosphere 
and space research. 

The model of a gas on which the kinetic theory is based is that of a set of 
molecules moving rapidly at random throughout the whole volume of the con¬ 
taining vessel. The energy of a monatomic gas is set equal to the sum of the 
kinetic energies of the molecules. In the simplest model the molecules are 
treated as elastic spheres (this assumption is discussed in more detail in chapter 
9) which on average travel a distance (the mean free path) which is large com¬ 
pared to their diameter, before making a collision. The total volume occupied by 
the molecules is assumed to be much less than the volume of the containing 
vessel. The kinetic theory will be seen in chapter 9 to be internally consistent 
in that the mean free path in a gas at N.T.P. calculated using the measured and 
predicted transport coefficients is indeed much greater than the molecular dia¬ 
meter. 

The statistical nature of the kinetic theory should be clear from the dis¬ 
cussion of chapter 7. An individual molecule will change its velocity after each 
collision. Only the probability of the molecule being found to have a given 
velocity (or small range of velocities) can be specified, but for a gas in thermal 
equilibrium the fraction of the molecules within the given range of velocities 
remains constant. The equilibrium results of kinetic theory will be discussed in 
this chapter and the transport theory in chapter 9. 
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8.1 DISTRIBUTION FUNCTIONS 


A number of distribution functions are sometimes useful in kinetic theory 
calculations although they do not contain any information that cannot be 
found from the partition function. The distribution function for some property 
x is defined either by the relation 

fix)dx = probability of x lying in the range x to x + dx 

or alternatively 

F(x)dx = number of molecules with x in the range x to x + dx 

In the first case the integral of the distribution function over all values of x is 
equal to unity and in the second case is equal to the total number of molecules 
(N). The two distribution functions are simply related 

F(x) = N f(x) 

It is usual in kinetic theory however to write simply/(x) for either distribution 
function. The parameter x may be either a vector (for example momentum, 
velocity) or a scalar (for example energy, speed) quantity. 

The fundamental distribution function may be derived directly from the 
classical partition function of a single particle 


f(p,r)d 3 pd 3 r = 


e P € d 3 p d 3 r 
J e _/3e d 3 p d 3 r 


( 8 . 1 ) 


: probability that a molecule has momentum in the range p to 
p + dp and position in the range r to r + dr 


The limits of the integrals are taken to be plus and minus infinity for the 
momentum and the whole of the volume of the container for the position 
coordinates. The extension to plus and minus infinity of the momentum vari¬ 
able is not of course consistent with the theory of relativity but this is of no 
practical importance since the exponential term is negligible at such high 
energies. When the momentum of a particle is independent of its position the 
left-hand side of equation 8.1 may be factorised 

f(p, r ) d 3 p d 3 r = f(p) d 3 pf(r) d 3 r 


and when gravitational effects are neglected the energy of a particle is indepen¬ 
dent of its position so equation 8.1 becomes 


f(p) d 3 p = ~ 

r 


P* d 3 p 


~P e d 3 p 


( 8 . 2 ) 



Kinetic Theory of Gases -1 

The energy of a particle is related to the momentum by 


159 


e = \mv 2 = —- = -— ( p x 2 + p y 2 + p z 2 ) 

2m 2m 

The normalised velocity distribution is most easily obtained by noticing that the 
triple integral in equation 8.2 is separable in cartesian coordinates. 

f(p) d 3 p = f(Px) dp x f(p y ) d Py f(p z ) dp z 


Therefore 


f(.Px)dp x = 


_£Px 1 
e 2m d Px 


0Px 2 

2m 


i 


dp x 


The denominator is a standard definite integral (appendix III) so 

0PX 1 


r(p “ )dp - - 

p - ( 2nmkT ) 


2m 


dp x 


1 _0P_ 

e 2m d 3 p 


are the normalised momentum distribution functions in one and three 
dimensions. 

The velocity distribution function follows immediately since 
p = mv p x = mv x 

f(n x )dv x = f(p x ) dv x 


Therefore 


f(v x ) du x 

/(b) d 3 B 


(8.3) 

(8.4) 


( m ■ 

\4 mv x 2 

) e 2kT dv x 

(8.5) 

\2nkT, 

m \ 

2itkTj 

-2. mv 2 


2 e“^ d 3 u 

(8.6) 


The momentum and velocity distribution functions therefore have maximum 
values at the origin and are symmetric about the origin since there can be no 
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preferred direction in space for a system in thermal equilibrium when gravita¬ 
tional effects may be neglected. The one-dimensional velocity distribution 
function is shown in figure 8.1. The area under the curve is always equal to unity 
but the width of the distribution at half-height increases and the maximum value 
decreases as the temperature is raised. 



V x (xIO 2 m s" 1 ) 


Figure 8.1 The velocity distribution function for a perfect gas (oxygen) in one dimension 
for temperature 300 K and 900 K. The area under the curve is always equal to unity. The 
mean value of the velocity is zero. 


Two features of the velocity distribution function are of some interest in 
connection with the nature of the quantity defined as ‘heat’ in part I. It is clear 
from figure 8.1 that the average velocity (magnitude and direction) of the mole¬ 
cule relative to the walls of the containing vessel is zero. The mean square 
velocity however is not zero (section 8.2) and increases with the temperature. 
The thermal energy is therefore associated with the random motion of the 
molecules within the container but external work involves the movement of a 
piston in a given direction. The conversion of heat into work therefore involves 
a change from the random motion of many particles to the ordered motion of 
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the piston so it is not surprising that a thermal cycle cannot operate with an 
efficiency of unity. 

The second feature of the velocity distribution function which should be 
noted is that the width of the distribution increases with temperature. The 
probability of finding the molecule in a given small range of velocity therefore 
decreases as the temperature increases. An alternative way of expressing this is 
to say that our information about the velocity of the molecule has decreased as 
the temperature increased and since from equation 7.70 an increase of tempera¬ 
ture at constant volume corresponds to an increase in the entropy of the gas 
there must be a relationship between information and (the negative of) the 
entropy. This point will not be pursued further here but the connection between 
information and entropy is sometimes used as a basis for statistical mechanics 
using equation 7.30 

S = - 2P r \nP r 

r 

to define the entropy of any system whether or not it is in thermal equilibrium. 

The speed distribution function is found by integrating the velocity distribu¬ 
tion function over all directions at constant v. The integration is most simply 
carried out by using polar coordinates. 

d 3 v = v 2 du sin 6 dd d 0 

/< " )d '’ * "■ d " 

= W 

\2nkTj 

The speed distribution function is shown in figure 8.2. The most probable speed 
is not zero but may be found by differentiating equation 8.7 

/ 2kT 
y mp “ V 

m 

The energy distribution function was derived in chapter 7 but may also be 
found from equation 8.7 using 

e = 7 mv 2 de = mv du 

/(e)de ' 2 (^) Tr " de (o) 

The energy distribution function is shown in figure 8.3. The remarkable feature 
of equation 8.8 is that the energy distribution function is independent of the 


/ IT C 27T , 

sin 6 dd I d(j> 
o Jo 


mv 

2kT 


dv 


(8.7) 
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f (eV) 


Figure 8.3 The energy distribution function for a perfect gas at 300 K and 900 K. Notice 
that (unlike the other distribution functions) the energy distribution function is 
independent of the mass of a molecule of the gas and is a function only of temperature. 


particle mass. The energy distribution function for all perfect gases is the same 
at temperature T. This result goes beyond the earlier statement of the equi- 
partition theorem, that the mean energy of a particle is independent of the mass, 
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to show that the distribution of the energy is also independent of the mass. In 
principle the perfect gas temperature T could therefore be defined by measuring 
the distribution function and using equation 8.8 to find the temperature. 


8.2 MEAN VALUES 

The mean value of a quantity such as x n is defined in terms of the distribution 
function to be 


- |W(*) 

c n = A - 

j f(x)± 


dx 


(8.9) 


where as usual the integral extends over the whole range of x. The mean value of 
the velocity 


f V x 

%/ — oo 


mv x 

2kT 


dt> Y 


Uv = 


= 0 


( 8 . 10 ) 


j: 


mvx 

e" 2kT dv x 


as may be seen immediately since the numerator is the product of an odd 
function with an even function. Similarly the other components of the velocity 
are zero, and in general 


v n = 0 (n odd) 

The mean square value of the velocity is found from 


( 8 . 11 ) 


f * 


mv x 

2 ~2 kT 


dv Y 


V x 2 = 


j: 


mv x 

e _ Tfcr" du 


kT 

m 


v 2 = v x 2 + v y 2 + v z 2 = 


3 kT 


m 


( 8 . 12 ) 


(8.13) 


The mean square width of the velocity distribution may be found from the 
general result 

(v~v) 2 = v 2 - (y) 2 = v 2 = 


m 


(8.14) 
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The width of the distribution for a given gas is therefore proportional to the 
square root of the absolute temperature. 

The root mean square velocity of a molecule is of the same order as the speed 
of sound in the gas (« 500 ms -1 at room temperature) since 

/ yP / ykT 

c* = V — = V — (8.15) 

p m 

where y is the ratio Cp/Cy equal to 5/3 for a monatomic gas. 

The mean speed is found using equation 8.7 

mv 2 

v 3 e 2kT dv 


mv 2 

f °° v 2 e 2kT dv 
J 0 

The integral now runs only from zero to infinity since speed is a scalar quantity. 
The mean square speed is 



8 kT 
nm 


(8.16) 


£ 


m v 
2 kT 


dv 


f 

Jo 


v 2 e 


mv 
2 kT 


3 kT 
m 


dv 


(8.17) 


in agreement with equation 8.13. 

The mean translational energy of a particle may be found from equation 8.8 
or from the equation 

e =\mv 2 =\kT (8.18) 

a result that was obtained previously as an example of the equipartition theorem. 

The important conclusion from these results is that the value found for any 
physical quantity (the coefficient of viscosity for example) will depend upon the 
way in which average values are taken. An incorrect average may lead to a 
relatively small numerical error, the speed values found so far for example are 
in the ratio 


y/v 2 :v:v mp = 1.22:1.13:1 (8.19) 

On the other hand, it may lead to complete nonsense as when the mean and 
mean square velocities are confused. An average value must be taken as late as 
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possible in the calculation as will become apparent from the calculation in 
section 8.4 of the pressure exerted by a perfect gas. 


8.3 DOPPLER BROADENING OF SPECTRAL LINES 

The radiation emitted by the molecules in a discharge tube is observed to have a 
finite width due to the random motion of the molecules. The motion of the 
molecules along the line of sight leads to Doppler broadening of the spectral lines. 
The transverse motion does not lead to Doppler broadening provided (as is always 
the case in the laboratory) that the velocity of the molecules is much less than 
the speed of light. The width of the spectral line may therefore be found from 
the one-dimensional velocity distribution function. 

The intensity of the radiation emitted by molecules with velocity v x to 
v x + dv x may be written 

I(v x )dv x = aNf(v x )dv x (8.20) 

where a is the radiation emitted per molecule. The wavelength of the light 
emitted by a molecule moving away from the observer with velocity v x is given 
by 



where X 0 is the wavelength of the light emitted by a molecule at rest. The 
intensity of the radiation given by equation 8.20 can therefore be related to the 
distribution as a function of wavelength since 



The intensity is a maximum at the unshifted wavelength (X 0 ) and has a Gaussian 
shape. The line may be normalised to the intensity at the centre of the line 


/ (X) - / (X 0 ) exp 


me 2 

2kT 



2 


( 8 . 21 ) 



166 


Thermal Physics 


The wavelengths for half the maximum intensity are given by 


In 2 = 


me 


2kT 



2 


or 



y 


2kT In 2 
m 


The full width of the line at half intensity is therefore 


_ 2A 0 / 2kT In 2 

c v m 


( 8 . 22 ) 


Since the mass of a molecule is fixed for a given gas the only way to reduce 
the Doppler broadening of a spectral line is to lower the temperature of the gas. 
An alternative approach is to allow the molecules to form a beam and to observe 
the radiation from the beam normal to the line of flight. The Doppler broaden¬ 
ing is then unimportant but the intensity of the radiation is much lower than in 
a bulk sample of the gas. 


8.4 THE PASSAGE OF MOLECULES ACROSS A PLANE SURFACE 


In this section an important general expression for the number of molecules 
crossing a plane area in the gas will be established, and used to derive the 
velocity distribution function in an atomic beam and the perfect gas law. 

The flow of molecules from one side through a plane surface of area S in the 
bulk of a gas in thermal equilibrium must be exactly balanced by the flow from 
the other side since there can be no preferred direction in the gas. The net flow 
is therefore zero and in the following discussion it is the flow of particles from 
one side only which is under consideration. 

One special group of molecules will first be considered and the total number 
of molecules crossing the plane will be obtained at the end of the calculations. 
The number of molecules per unit volume of the gas with speed in the range v to 
v + dv is (from equation 8.7) 


47 m 



e 


mv 2 


2kT 


du 


where n is the total number of molecules per unit volume. 

Since the gas is homogeneous, the number of molecules moving in a direction 
defined by an infinitesimal solid angle dco is simply dco/47r. Consider the situa¬ 
tion shown in figure 8.4 where the element of solid angle makes an angle 6 with 
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Figure 8.4 A molecule with speed v travelling at an angle 6 to the normal to the surface S 
will pass through the surface after a time dr if it was originally within the cylinder of 
base S and slant height v dr. 


the normal to the surface S. A molecule with speed v will cross the surface S 
after a time d t if it is within a distance v d t of the surface. The total number of 
molecules crossing the plane in time d^ with speed v is equal to the number con¬ 
tained in a cylinder of base S and slant height v d t. The volume of the cylinder 
is therefore vS cos 9 d t. 

The number of molecules crossing S in time dt with speed in the range v to 
v + du in solid angle dc o making an angle 6 with the normal to S is 

4 *' , (ds ; ) ! e ‘ tT dv (£) (vS °° se i0 

The element of solid angle may be written in polar coordinates 
dco = sin 9 d0 d0 

so the number of molecules crossing unit area in unit time may be written 

— mu 2 

v 3 e 2kT du sin 9 cos 9 d 9 d0 (8.23) 

This is the general result from which the total number of molecules crossing the 
plane from one side and the pressure exerted by the gas may be evaluated. It is 
also the distribution of the molecules which emerge from a sufficiently small 
hole in the containing wall (an atomic beam, section 9.2). 

The total number of molecules crossing unit area in unit time from one side 


d R = n 


m 


2nkT 
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is found by integrating equation 8.23 over all speeds and all allowed angles. 

( \ 2 mx> 1 1 

m \ 2 C °° 3 ~ 2kT P 2 p2n 

Uti) J.^ e d»J o *.9co.«d«/ o d* (8.24) 


The integration over 0 is restricted to the range 0 to n/2 because molecules 
making an angle greater than n/2 with the normal to the plane are moving in the 
opposite direction from that under consideration. The net flux of molecules is 
found by allowing 0 to run from 0 to 7r and is then seen to be correctly zero. 

The first integral in equation 8.24 is, from equation 8.7, equal to the mean 
speed of the molecules (apart from a factor of 47r) and the angular integrals are 
equal to \ and 2n respectively. 

The total number of molecules crossing unit area from one side in unit time is 
R = \ nv (8.25) 


The average value v was correctly found at the end of the present calculation. 
The more elementary calculation which considers 1/6 of the molecules to be 
moving in one direction along any axis in the gas would obviously lead to a result 
nv/6 but the correct average for v is undetermined. 

The importance of performing averages at the end of a calculation is also 
apparent from the calculation of the pressure exerted by a gas. In a dilute gas the 
number of molecules striking unit area of the wall of the container in unit time 
is identical to the number of molecules passing from one side through an 
imaginary surface in the interior of the gas. The momentum delivered normal to 
unit area of the wall in unit time by molecules with speed v moving at an angle 
0 to the normal to the wall is then, from equation 8.24 

t 3 mv 2 

n /-\ y 3 e 2 k T du sin 0 cos 0 d0 d0 

\2nkT) J 

If the molecules remained attached to the wall after striking it the pressure 
exerted by the gas on the wall would simply be equal to 


P = 




sin d cos 6 dd 



(8.26) 


nmv 2 nkT 
6 ~ 2 


(8.27) 


using the result of equation 8.13 for v 2 . 

When the molecules collide with the wall of the container and return to the 
bulk of the gas, the total pressure exerted must be twice that given by equation 
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8.27 regardless of the nature of the collisions, so the usual result 

P = \nmv 2 = nkT (8.28) 

is obtained. A molecule could either make a specular reflection (angle of incidence 
equal to angle of reflection) with the wall or return to the gas at an angle inde¬ 
pendent of the angle of incidence. It was first shown by Knudsen that all (or 
nearly all) of the molecules return to the gas with a distribution identical to that 
with which they arrive (equation 8.23). This point is considered in more detail 
in section 9.4.3 but the pressure exerted by the molecules is independent of the 
nature of the surface provided that the molecules return to the bulk of the gas. 

The importance of performing averages at the end of a calculation is apparent 
from equation 8.26. An attempt to use the exact result for the tot^l number of 
molecules passing through the plane (equation 8.25) and some kind of average 
value for the momentum would not have been successful. (The roughest calcula¬ 
tion involving 1 /6 of the molecules moving in a given direction which was seen 
to be incorrect for the total number of molecules crossing the plane does how¬ 
ever lead to equation 8.28 due to the coincidence that the integral over the 
angle 0 in equation 8.26 is equal to 1/3.) 

The form of equation 8.25 is extremely important for surface physics. The 
clean surface of the material to be studied is normally prepared in a vacuum 
chamber with residual pressure P. The clean surface will be contaminated by 
molecules coming from the vacuum space at a rate given by equation 8.25. 
Assuming that all the molecules remain on the surface and have diameter d the 
time r to completely cover the surface is given by 

(H (t ! ) r - 1 

r may be related to the pressure using the gas law in the form P = nkT and 
equation 8.16 may be substituted foru to give 


16 / nm kT 

7 ~ nd 2 V 8 kT P 


(8.29) 


This time is roughly 30 s (exercise 8.3) for a pressure of 10 -7 torr at room 
temperature (1 torr = pressure of 1 mm mercury = 333 N m -2 ). Since a pressure 
of 10 -7 torr is quite a good laboratory vacuum (an ultra-high vacuum is usually 
considered to be better than 10~ 8 torr) it is clear that surface contamination 
can be a serious problem in surface physics and has in the past led to many 
spurious results. 

A further use of equation 8.25 is in the measurement of small vapour pres¬ 
sures. The number of molecules of vapour striking the surface would be given by 
equation 8.25 and this must also be the number of molecules of the solid which 
leave the surface to preserve the equilibrium. If all the molecules leaving the 
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surface are collected then n can be determined and hence the vapour pressure 
(using equation 8.28). 


8.5 EFFUSION 

When two gas containers are joined together by a pipe of large diameter the gas 
comes to an equilibrium condition in which the pressure is constant throughout 
the system. The gas flow between the containers while the system comes to 
equilibrium is essentially a hydrodynamic problem. If the temperature of one 
of the containers is now changed a further mass flow will occur until the pres¬ 
sures are again equalised. Two containers joined by a pipe of small diameter or a 
porous membrane however behave quite differently (figure 8.5). 


P, 7*a 


PJs 





Pa .7a 

Pq » 7"b 




Figure 8.5 Two containers joined by a pipe of large diameter always maintain the same 
pressure. Two containers joined by a pipe of small diameter or by a porous plug are only 
at the same pressure if their temperatures are equal. 


When the aperture joining the two containers is sufficiently small the mole¬ 
cules in the region of the aperture may make a collision and pass out of the 
container without affecting the equilibrium of the rest of the gas. A small 
aperture is therefore one which has a dimension much less than the average 
distance a molecule travels between collisions (the mean free path, section 9.1). 
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The number of molecules passing through the aperture in this case will be given 
by equation 8.25 provided that the thickness of the aperture is sufficiently 
small for the integrations in equation 8.24 to be applicable. 

The number of molecules passing from one container to another will, in the 
steady state, be given by 

\ n AVA = 4 n hVB 

Therefore 

PaVa __ PbVb 
Ta ~ T b 


The pressure in the system is therefore constant when both containers are at the 
same temperature but in general (substituting forTJ using equation 8.16) 


Pa _ Pb 
s/Ta s/T b 


(8.30) 


The process of gas transport through small apertures is called effusion and the 
pressure difference produced by a temperature gradient is called a thermo- 
molecular pressure difference. The importance of equation 8.30 is that in low- 
temperature physics the temperature of the system is often measured in terms of 
the pressure of helium vapour. The mean free path in a gas at low pressure 
(section 9.4.3) may become greater than the diameter of the coupling tube to the 
manometer and the measured pressure for a system at 1 K and a manometer at 
300 K for example would then be in error by a factor of \/300. The possibility 
of a thermomolecular pressure difference must therefore always be considered 
in systems which contain temperature gradients and low-pressure regions. 

Knudsen first demonstrated that the thermomolecular pressure difference 
could be used to produce an absolute manometer for the measurement of 
pressures in the range 10 -2 - 10“ 7 torr. The manometer is shown schematically 
in figure 8.6. The vane A is suspended from a quartz fibre so that the separation 
between A and the block B is much less than the mean free path in the gas. 

When the temperature of the block B is raised by a heater from temperature T 
to temperature T x a thermomolecular pressure difference will be set up and the 
vane will rotate. If the pressure in the system at temperature T is P and the 
pressure between the block and the vane is P x 


p i = / T ' + T 

P V 2T 


(8.31) 


where the gas temperature in the gap is assumed to be the average of T and 7\. 
When ( T x - T ) is much less than T the equation may be simplified using the 
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Figure 8.6 Plan of Knudsen absolute manometer. The block B (surrounded by guard ring G) 
is heated causing the vane A (suspended from a fibre C) to rotate. 


binomial theorem 



P \ TT ) AT 

The net force per unit area on the vane A is 



(8.32) 


The force constant of the quartz fibre may be found in a separate experiment 
and then only the temperature of the system and the block B is required to find 
the pressure in the system. An attractive feature of the Knudsen manometer is 
that the measured pressure is independent of the gas in the system. The mano¬ 
meter is said to be ‘absolute’ because it measures pressure directly in terms of 
the mechanical properties of the quartz fibre (which can be determined in a 
separate experiment) rather than by comparing two pressures as is commonly 
the case with, for example, the Rrani gauge (section 9.4.3). The lower limit to 
the useful pressure range of the Knudsen manometer is about 10” 7 torr. At 
lower pressures the torque due to the thermomolecular pressure difference 
becomes so small that it is difficult to measure. At the upper limit (10 -2 torr) 
the separation between the plates cannot be made much less than the mean free 
path in the gas and equation 8.30 no longer applies. 

One further important use of equation 8.25 will be discussed in this section. 
When two different gases are enclosed in a porous container the initial effusion 
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rates will depend upon both the concentration and the mass of the molecules. If 
the molecules are of type A and type B and there are initially n A molecules of 
A and n B molecules of B per unit volume then the ratio of the molecules in the 
container is n A /n B hut after effusion the ratio outside the container is n A v A /n B v B 
or substituting for the mean speed from equation 8.16 

R A = ”4 /«B (8 .33) 

R b n B v m A 

The gas inside the container is now richer in the heavier component than it was 
before the effusion process started. The separation of the two gases is of course 
only partially due to a single effusion process but the enriched gas may be used 
in a cascade process to obtain complete separation. This method was used for 
the first separation of the isotopes of uranium with the uranium in the form of 
the gas UF 6 . 


EXERCISES 

8.1 Find expressions for v 3 andu 3 . 

8.2 The mass of an atom of a monatomic gas is 1.4 x 10 -25 kg. Show that the 
fractional width of the Doppler broadened spectral line is 2 x 10 -6 when the 
gas is at a temperature of 650 K. 

8.3 A clean metal surface is placed in a vacuum which has an oxygen pressure of 
10~ 7 torr at room temperature. Estimate the time taken to cover the surface 
with oxygen if the diameter of an oxygen molecule is 0.23 nm and the system is 
at room temperature. How long would it take to half cover the surface if the 
pressure was reduced to 10 -10 torr? 

8.4 Show that the mean square velocity of a molecule emerging from a small 
hole in a gas container at temperature T is equal to 4kT/m. Find the mean 
energy of the molecule and comment on the difference between this and the 
mean energy of a molecule inside the container. 

8.5 Find an equation for the ratio of the concentrations of the two molecular 
species in a porous container during an effusion process if the gas that has 
effused is pumped away from the container. Hence show that no complete 
separation of the two species is possible in a single operation. 

8.6 Find an expression for the number of electrons emitted from unit area of 
the surface of a metal in unit time if all electrons which strike the surface with 
velocity greater than v 0 normal to the surface escape and the electrons may be 
treated as a perfect gas. 



9. Kinetic Theory of Gases—II 


9.1 THE MEAN FREE PATH 

The concept of an average distance that a molecule travels between collisions (a 
mean free path) is not required in the rigorous transport theory of the classical 
perfect gas but remains a useful concept at the more elementary level adopted in 
this chapter. The mean free path is well defined for a gas of elastic spheres but 
requires more careful consideration when the more realistic model of a molecule 
as a scattering potential is used. 

Consider first an elastic sphere of radius b travelling with speed v through a 
random arrangement of stationary spheres of radius a . When the moving sphere 
makes a collision with one of the stationary spheres the centres of the two 
spheres are at a distance (a + b ) (figure 9.1). The direction of the motion of the 
moving sphere will change after each collision but in a time t (long compared to 
the time between collisions) the total distance travelled is simply vt. The 
volume swept out by the moving sphere in time t is mt(a + b ) 2 . If the average 



\ / 


Figure 9.1 A sphere of radius b sweeps out a volume with cross section equal to n{a + b ) 2 
in a. distribution of spheres of radius a. 
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number of spheres of type A per unit volume is n 

. „ f distance travelled in time t 

Ai = mean free path =- 

number of collisions in time t 


vt __ 1 

m{a + b) 2 vt mrd^y 2 


(9.1) 


where 

<4v = y (2 a + 2b) 

is the average diameter of the two types of sphere. The quantity 

o = 7r<2 av 2 (9.2) 

is called the scattering cross-section. 

The mean free path of a particle in a real gas of moving molecules is still 
given by equation 9.1 if the particle of interest is moving much faster than the 
mean speed of the gas molecules. An electron in thermal equilibrium at tempera¬ 
ture T will be moving much faster than a molecule of mass m since from equa¬ 
tion 8.13 


\m Q v e 2 - \mv 2 = \ kT 

and m>m e . The diameter of an electron is also found to be negligible relative 
to that of a molecule so equation 9.1 becomes 

Xe = (wra 2 ) -1 (9.3) 


The calculation of the mean of all the free paths in a gas of elastic spheres 
moving with a Maxwellian distribution of velocities is straightforward but the 
proof is tedious and will be omitted here. It is possible in fact to define more 
than one type of average in this case but the most commonly used (sometimes 
called the Maxwell free path) is 


total distance travelled by the molecules in time t 
2 x number of collisions in time t 


Xi_ 

V2 


(9.4) 

(9.5) 


The factor of 2 in equation 9.4 is required because each collision terminates two 
free paths in the gas. 

The mean free path of a molecule in a gas is clearly a function of its speed. 

A stationary molecule has a mean free path of zero, a very fast molecule a mean 
free path given by \ x and a typical molecule a mean free path of A. The 
Maxwell mean free path may be written in a number of equivalent forms using 
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the gas laws. In a gas containing only one type of molecule of diameter d, for 
example 


X = 


1 

\J2 md 2 


kT 

\/2 nd 2 p 


(9.6) 


The mean free path of a molecule in a given gas therefore varies such that 

X &n~ l density -1 (9.7) 


The diameter of a gas molecule (found from the measurement of the coefficient 
of viscosity (section 9.4.1) is typically 0.3 nm so at N.T.P. the Maxwell mean 
free path is about 10 2 nm. The mean free path is much greater than a molecular 
diameter but is also much less than the dimensions of a normal container. The 
molecules therefore spend most of their time in free flight but normally end a 
free path by colliding with another molecule rather than the wall of the con¬ 
tainer. When the mean free path becomes comparable with the dimensions of 
the system the interaction of the gas molecules with the surface of the container 
has to be considered (section 9.4.3). The molecules collide on average after a 
time roughly equal to that required to travel a mean free path at the average 
speed. At N.T.P. this time is about 10” 9 s. In chapter 1 it was seen that from 
computer studies it appears that a gas with a non-equilibrium distribution of 
velocities approaches equilibrium after two collisions per molecule. An equili¬ 
brium velocity distribution is therefore established at N.T.P. in a time short 
compared to most experimental measuring times but at low pressure and 
temperature the relaxation time will increase. 

At first sight it appears to be quite unrealistic to attempt to represent the 
molecules of a gas by a set of hard elastic spheres. The theory of the scattering 
of one atomic particle by another forms an important branch of quantum 
theory and the general scattering problem can certainly not be reduced to the 
single energy-independent cross-section found for elastic spheres. The subtleties 
associated with the detailed quantum theory of atomic scattering however turn 
out to be of most importance when the incident particle is only slightly deflected 
from its original path (low-angle scattering). The mean free path of electrons 
which are scattered out of a narrow beam is therefore energy dependent. In trans¬ 
port theory however the molecules which make the most important contribution 
to the coefficient of viscosity or the thermal conductivity are found to be those 
which are scattered through large angles and an energy-independent cross-section 
is found to be a much better approximation for large-angle scattering than for 
small-angle scattering. The cross-sections found from atomic beam measure¬ 
ments would therefore be expected to be greater than those found from measure¬ 
ments of the transport coefficients. 

The effective cross-sections for gas molecules, calculated using the measured 
transport coefficients and the kinetic theory discussed in section 9.4.1 are found 
to be temperature dependent showing that the elastic hard sphere model cannot 
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be exact but the dimensions found for the molecules are of the right order of 
magnitude. 


9.2 ATOMIC BEAMS 


The atoms of a gas may be allowed to escape from an oven at temperature T 
through an aperture which is sufficiently small not to disturb the equilibrium in 
the bulk of this gas (section 8.5). If the space outside the oven is evacuated to a 
pressure of about 10“ 7 torr the molecules will travel a mean free path of about 
1 m before colliding with a molecule of the residual gas in the vacuum. The 
velocity distribution in the beam will be given by equation 8.23 provided that 
the thickness of the defining slits of the aperture in the oven is sufficiently 
small. The temperature to be used in equation 8.23 is that of the oven from 
which the beam emerged. The beam cannot be said to have a temperature since 
it is not in thermal equilibrium but has a ‘memory’ of the oven temperature 
until the molecules collide with the residual gas molecules in the vacuum. An 
atomic beam apparatus may be used to verify the Maxwell velocity distribution 
in the form given by equation 8.23 (section 9.3) but in a normal apparatus the 
beam emerges from the oven not through a thin slit but from a long pipe. This 
increases the flux of molecules in the forward direction but the pressure in the 
oven must be lowered until the mean free path in the gas is of the order of the 
length of the pipe. 

The attenuation of an atomic beam may be expressed in terms of the mean 
free path in the vacuum space. The simplest case to consider is that of a beam of 
particles of speed v each of which is thrown out of the beam after making one 
collision with a molecule of the residual gas in the vacuum space. If the number 
of particles left in the beam after traversing a distance x in the vacuum is N(x) 
the number scattered in a length dx will be proportional to N(x) and may be 
written 


dN - - aN(x) dx 


(9.8) 


where a is independent of x and is made into a positive quantity by the negative 
sign on the right-hand side of the equation. The solution of equation 9.8 is 

N(x) = N(0)e- ax (9.9) 

where N( 0) is the initial number of molecules in the beam. The probability of a 
particle travelling for a distance x without making a collision is 


P(x) = e~ a * 
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The mean free path of a particle in the beam is given by 


K 


j; 

s: 


e ax dx 


e ax 6x 


1 

a 


so the final form for the attenuation of the beam may be written 

N(x ) = N{ 0) e~ ^ (9.10) 

The scattering cross-section may therefore be found by measuring the attenua¬ 
tion of a beam as a function of distance. If the particles in the beam are passed 
through a velocity selector before they reach the vacuum space, the cross-sec¬ 
tion may be studied as a function of energy. The early studies of electron - mole¬ 
cule scattering using this method first showed that the scattering cross-section 
was a function of the electron energy rather than a constant as suggested by 
equation 9.1. 


9.3 THE VERIFICATION OF THE MAXWELL VELOCITY DISTRIBUTION 

An atomic beam apparatus combined with a velocity selector [figure 9.2(a)] may 
be used to verify the Maxwell velocity distribution in the form appropriate for 
the molecules effusing from an oven (equation 8.23) 

_ mt^ 

f(v)dv=Av 3 e 2kT di> (9.11) 

where A is a constant for a given gas and temperature. The velocity selector 
could be of the notched-wheel type used in the classic experiments of Fizeau 
on the velocity of light but a more effective velocity selector is formed by 
cutting a series of slots in a cylinder, as shown in the diagram. When the cylinder 
is rotating with angular velocity co a molecule with speed v will travel along the 
slot without changing its position relative to the sides of the slot if 

c oL 

v =- 

00 

where L is the length of the cylinder and 0 O is defined in figure 9.2(b). Molecules 
with a range of speeds will pass through the slots at a given value of co since the 
slots have finite width. It may be shown that the range of speeds is proportional 
to the central speed v 0 and may therefore be written yv 0 where 7 is a constant. 
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Figure 9.2 (a) A schematic diagram of an atomic beam apparatus. The vapour leaves the 
oven (O) and passes into a vacuum space containing a velocity selector (V) and a 
detector (D). (b) Details of velocity selector. 


The intensity of the particles incident on the detector is therefore given by 
equation 9.11 multiplied by the factor yv 0 or 

_ mv o 2 

I (Po) = constant y 0 4 e 2kT (9.12) 

The equation may be written in a universal form (independent of temperature) 
in terms of the velocity ratio V defined by 


V = 




predicted velocity at maximum intensity 


when the intensity becomes 

7(F) = 87 F 4 e~ 2V * (9.13) 

The agreement between this equation and the experimental result for potassium 
vapour is illustrated in figure 9.3. The best agreement is found when the defining 
slits of the aperture in the oven are as thin as possible, and the over-all agreement 
is obviously good but the experimental results indicate that fewer slow molecules 
passed through the system than would be expected from equation 9.11 unless 
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Figure 9.3 The measured velocity distribution for potassium vapour. The Maxwell distribu¬ 
tion is shown by the full line. The best agreement with theory was found in experiments 
using thin oven slits and low vapour pressures (shown by circles). (After R. C. Miller and 
P. Kusch,i%ys. Rev. 99 (1955) p. 1314.) 


the vapour pressure in the oven is as low as possible. This result is understandable 
since very slow molecules spend a comparatively long time in the vicinity of the 
oven slits and may therefore be scattered out of the beam before reaching the 
velocity selector. 

The agreement between the best experimental results (obtained for thin oven 
slits and a low vapour pressure in the oven) is extremely striking when it is con¬ 
sidered that there are no adjustable parameters at all in equation 9.13. The 
theoretical velocity at maximum intensity is given from equation 9.12 by 


v 


max 



kT 

m 


and may be calculated from the oven temperature T. The quantity V is therefore 
defined and the quantity y is determined from the geometry of the velocity 
selector. 

The Maxwell velocity distribution may be considered to be well verified by 
both atomic beam experiments and also more indirectly by experiments such as 
the measurement of the Doppler broadening of spectral lines (section 8.3). 


9.4 TRANSPORT PROPERTIES OF A PERFECT GAS 

All the processes discussed so far in this book have related to systems which 
were either in true thermal equilibrium or (as in the case of atomic beams) could 
be related to a system which was effectively in thermal equilibrium. The basic 
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concept of thermodynamics and equilibrium statistical mechanics has been seen 
to be that any isolated system will change from an arbitrary initial state to a 
final equilibrium state whose properties are independent of time. When gravita¬ 
tional effects are neglected a system in thermodynamic equilibrium has no 
preferred direction in space. The number of molecules crossing unit area of a 
surface in unit time was seen to be riv/4 (equation 8.25) if transport from only 
one side was considered, but the net flux of molecules was zero. 

A more complicated situation arises when the system is not isolated but is in 
contact with reservoirs which drive the system from the equilibrium state. For 
example a pressure gradient along a tube will lead to a net flux of particles to¬ 
wards the low-pressure reservoir, a temperature gradient will lead to a net flux 
of energy and possibly also a net flux of particles and so on. An isolated system 
comes to thermal equilibrium in some characteristic time (the relaxation time) 
after which its properties are independent of time. Similarly it is found that the 
properties of a system which has a temperature gradient maintained across it 
also become time independent after some characteristic time. The system is then 
said to be in a steady state. 

The steady-state properties of a perfect gas will be considered in the next two 
sections but it must be emphasised that the theory of the steady state is a vastly 
more difficult problem than that of the equilibrium state. The properties of any 
system in an equilibrium state can be found in principle using the canonical or 
grand canonical partition function. The methods used for the calculation of the 
transport properties of the perfect gas in this book cannot however be extended 
to systems of strongly interacting particles. In recent years it has been shown by 
more advanced methods that the transport properties of any system at given 
temperature may be expressed in terms of the fluctuations of the system about 
its equilibrium mean values. In this sense it is possible to write expressions for 
the transport properties (Kubo formulae) which are as general as the partition 
function results for equilibrium properties, but the actual evaluation of, say, the 
thermal conductivity of a metal has not been made any easier by the general 
result. 

The transport properties of a gas depend upon the ratio of the mean free path 
in the gas (X) to the smallest dimensions of the container (a) and the molecular 
diameter (d). Four separate regions may be identified. 

Region I a > X d (dense gas) 

The theory of transport processes in dense gases and in liquids is an active 
branch of current research but will not be considered in this book. 

Region II a>X>d (normal transport properties) 

A gas at N.T.P. has been seen in section 9.1 to satisfy these conditions. The over¬ 
whelming majority of the molecular collisions are between molecules. The colli¬ 
sions between the molecules and the surface of the container may safely be 
ignored when calculating the transport properties. The measured value of the 
transport coefficients in this region will be independent of the size of the 
container. 



182 


Thermal Physics 


Region III a^,\>d (transition region) 

The collisions of the molecules with the surface of the container are now of 
comparable importance to intermolecular collisions. The value of the coefficient 
of viscosity in this region [measured by the rate of flow of the gas along a tube 
(equation 9.24)] now appears to depend upon the ratio a/X. The theory of 
transport processes in this transition region will be briefly discussed after the 
results for region II have been obtained. 

Region IV X>a>d (free molecular region) 

The molecules now make more collisions with the container than with each 
other. The simplest approximate treatment is now to ignore intermolecular 
collisions completely. Region IV is of great importance for vacuum physics since 
the mean free path in a gas is « 0.05 m at a pressure of 10“ 3 torr and is inversely 
proportional to the pressure at constant temperature. A good modern vacuum 
system operates at pressures in the range 10 -8 to 10 -14 torr where the condi¬ 
tions for region IV are obviously well satisfied. Similarly in outerspace (where 
the effective pressure is only about 10 -14 torr) the mean free path is much 
greater than the dimensions of a space vehicle. 

9.4.1 Normal transport properties (region II) 

The transport properties in this region are dominated by intermolecular colli¬ 
sions so collisions with the surface of the container will not be considered in this 
section. In an experiment to measure the thermal conductivity of a gas, a con¬ 
stant temperature gradient is maintained in the gas (figure 9.4). The experiment 
must be arranged so that no heat is transferred by mass flow (convection). The 
coefficient of thermal conductivity is defined by the equation 

Q = - kA dT - (9.14) 
dx 

where Q is the energy transferred per unit time in the direction x normal to the 
surface of area A (figure 9.4). 

In thermal equilibrium there is no nett flux of particles through the surface 
A . The basio concept of elementary transport theory is that at the microscopic 
level the temperature gradient (or any other driving mechanism) causes only a 
small perturbation of the system from the equilibrium state. The temperature 
gradient in an experiment might for example be « 10 2 K m” 1 . The mean free 
path in a gas at N.T.P. is about 10" 7 m. The ‘temperature’ change across a mean 
free path is therefore about 10 -5 K which is very much less than the tempera¬ 
ture of the gas. The transport coefficients defined by equations of the type 9.14 
are therefore assumed to be independent of the magnitude of the driving gra¬ 
dient. It is possible in certain systems (very pure semiconductors at low tempera¬ 
ture for example) to produce sufficiently large gradients to make this assumption 
incorrect but it may be safely employed for gases. 
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Figure 9.4 In a thermal conductivity experiment a temperature gradient is maintained across 
the gas by external thermal reservoirs. The experiment must be arranged to prevent heat 
transport by convection. The molecules are assumed to make their last collision before 
crossing the reference plane at a distance equal to the mean free path in the gas and to 
take up the temperature (energy) of the molecules in this layer of gas. 


As has already been stressed the rigorous theory of transport in a perfect gas 
is mathematically difficult. An outline of the rigorous theory will be given in 
section 9.5 but in the present section a very qualitative argument will be used to 
find expressions which are correct (apart from a numerical factor) for a gas of 
elastic spheres. The theory of polyatomic gases and gas mixtures will not be 
considered. 

The elementary calculation of the quantity Q in equation 9.14 may be made 
on the basis of two assumptions: (a) the number of molecules crossing unit area 
in unit time is ri&l4 as for a gas in thermal equilibrium and (b) a molecule which 
crosses the reference plane at x 0 makes on average a last collision a distance 
equal to the mean free path (X) from the reference plane and takes up the tem¬ 
perature characteristic of the planes (x + X) or (x - X) (figure 9.4). The first 
assumption should immediately make the calculation suspect, since it involves 
using the integrated quantity rw/4 rather than beginning by considering a group 
of molecules with speed in the range v to v + dv. 

The energy transport across the area A in unit time may now be written 

Q = \nvA [e (x + X) - F (x - X)] 

where F(x) is the mean energy of a molecule at x. Since the thermal gradient 
over a mean free path has been seen to be very small, the energy terms may be 
expanded and only the first two terms in the series retained. 

Q = \nvA F (x) + X — - F (x) + X —1 
L dx dX J 

i 

= -z-nvAX — 
dx 
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The mean energy of a molecule of a monatomic gas is 


e = 2 kT 
so 

de _ de d T _ d T 
dx d T dx V dx 


i — a i 

Q = jnvAX c v — 
dx 

and by comparison with equation 9.14 

k = ^nvX c v (9.15) 

where c v is the specific heat per molecule at constant volume. The factor of \ 
cannot of course be taken seriously after such a careless method of taking 
average values. A rather more careful calculation leads to the standard result 
(frequently used in solid-state physics) 

k = jnvX c v (9.16) 

but this too is incorrect. The rigorous result for a gas of hard spheres is 

k = l.23nvXc v (9.17) 

All three results however have the same functional form. At constant tempera¬ 
ture v is constant for a given gas and the product nX is always a constant for a 
gas of hard spheres (equation 9.7) so k is independent of n (gas density) at a 
given temperature. This surprising result (which will also be seen to be true for 
the coefficient of viscosity) is in excellent agreement with experiment for gases 
in region II and was one of the first successes of the kinetic theory of gases. 

At constant density the thermal conductivity (according to equation 9.17) is 
proportional to t; and therefore to the square root of the temperature. This pre¬ 
diction agrees less well with experiment, a law of the form 

K*T n (constant density) (9.18) 

with n about 0.7 being observed for monatomic gases. This result is under¬ 
standable if the effective diameter of a molecule decreases with temperature. 

The molecules move faster at higher temperature and can therefore come closer 
together when making a collision. The 0.7-power law can be derived for various 
models of a molecule as a field of force. The simplest model involves only a 
repulsive force between the molecules of the form \jr n where n is about 11 for 
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hydrogen and 15 for helium. The elastic sphere model would give a value of n 
equal to infinity. Many more elaborate laws of force have been used to improve 
the agreement between theory and experiment. 

These calculations were of great importance in the early days of kinetic 
theory since they provided the first information about molecular sizes using, 
say, equation 9.17 for the coefficient of thermal conductivity and equation 9.6 
to relate the mean free path to the molecular diameter. Molecular scattering now 
more naturally forms a part of quantum mechanics and will not be discussed 
further in this book. 

A simple calculation will now be given for the coefficient of viscosity of a gas. 
The definition of the coefficient of viscosity may be given in terms of figure 9.5. 

z - 


U x 


x 


U x =0 

v////////. 

Figure 9.5 When the velocity of mass flow in a gas (U x ) depends upon the distance (z) 
from a reference plane normal to the flow a force is exerted on each layer of gas by its 
neighbouring layers. The force per unit area is proportional to the gradient ( dU x /bz ) if 
U x is much less than the mean thermal velocity of the molecules. 


The layer of gas next to the wall is assumed to be at rest and therefore exerts a 
force tangential to the surface on the next layer of gas which is moving. Assum¬ 
ing that the velocity of mass flow of the gas (U x ) is much less than the mean 
thermal speed of the gas molecules, the tangential force between the layers will 
be proportional to the velocity gradient. The force per unit area may therefore 
be written 


F = 


a Ux 

dz 


(9.19) 


where r? is a constant called the coefficient of viscosity. 

The calculation of the coefficient of viscosity is now carried out as for the 
thermal conductivity but it is now the net flux of momentum across the plane 
which is of interest, since by Newton’s third law (F = dp/dt) this is equal to the 
net force exerted. Therefore 


F = j-nvm [U x (z + X) - U x (z - X)] 


[' 


= \nvm | U x (z) + X — - U x {z) + X — 
dz dz 
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r? = \nmv\ = \pv\ (9.20) 

where m is the mass of one molecule and p the density of the gas. A slightly 
better calculation again leads to a value of 1/3 rather than 1/2 for the numerical 
factor. The rigorous result for hard spheres is 

r? = 0.499 nmvX (9.21) 

The coefficient of viscosity of a perfect gas in region II is therefore predicted to 
be independent of the density of the gas at constant temperature. This result 
was apparently first obtained experimentally by Boyle in 1660 (using the 
damping of the oscillations of a pendulum as a measure of the coefficient of 
viscosity) but was forgotten until Maxwell derived it theoretically and then con¬ 
firmed it by experiment. The coefficient of viscosity is predicted by equation 
9.21 to be proportional tov and therefore to the square root of the absolute 
temperature but is found from experiment to be more nearly proportional to 
r 0,7 , as was the thermal conductivity. 

The combination of the equations for the thermal conductivity and coeffi¬ 
cient of viscosity of a gas eliminates the mean free path and leads to one of the 
most striking predictions of the kinetic theory 


k = oct )— = arj — ( 9 . 22 ) 

m M 

The coefficient a is equal to unity in the simple theory given in this section and 
to 2.52 in the rigorous theory of a hard-sphere gas. The value of a found from 
experiment using equation 9.22 is therefore a direct check on kinetic theory 
calculations. The experimental values of a for monatomic gases are in quite good 
agreement with the value calculated by the rigorous theory: the value of a for 
helium for example is 2.44 and for argon is 2.47. Diatomic gases give lower 
values, for example 1.92 for nitrogen. The discrepancy is attributed to the 
different contributions to the coefficient of viscosity and the thermal conducti¬ 
vity made by molecular rotations. 

The good agreement between theory and experiment for the monatomic gases 
is not of course evidence that molecules may be treated as elastic spheres but 
rather that the effective size of a molecule in a thermal conductivity experiment 
is similar to that in a viscosity experiment. The calculated value of ot for a gas of 
molecules interacting via a repulsive potential of the form \ \r n for example is 
2.50 which is almost identical to the result for a hard-sphere gas. 

9.4.2 Gas flow along a tube (Poiseuille flow) 

The flow of gas along a pumping tube in a system evacuated from room pressure 
is at first a flow process in region II and is controlled by the coefficient of vis- 
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t: 


a 



Figure 9.6 (a) The equilibrium of an imaginary cylinder in the gas depends upon the forces 
exerted on the end faces by the pressure gradient being balanced by the frictional force 
due to viscosity exerted on the curved surface. Notice that the origin of the coordinates 
lies on the line through the centre of the cross - section of the tube, (b) The parabolic 
variation of the mass velocity across the tube. 


cosity. At lower pressure the mean free path in the gas becomes greater than the 
diameter of the pumping tube (region IV) and the concepts of the next section 
are required. 

The gas flow in region II will be assumed to be laminar flow in the steady 
state, or as it is sometimes called, Poiseuille flow. A fast vacuum pump may in 
fact produce non-laminar flow in the pumping tube but this aspect will not be 
considered here. The variation in the drift velocity of a gas across a tube of 
circular cross-section and radius a may be found by considering the equilibrium 
of an imaginary cylinder in the gas (figure 9.6) under conditions of laminar 
flow. The viscous force on the curved surface of the cylinder is balanced by the 
difference between the forces acting on the two plane faces so 


Therefore 


2 d P ^ d U x 

— nr — = — 2nrr) - 

dx dr 



-f — f r dr 

2 r? dx J 


U r 


r 2 d P 

— — + constant 
4t? dx 
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In region II it is assumed that the layer of gas next to the surface of the tube is 
at rest. Therefore U x is zero when r is equal to a and 


U x 


r 2 - a 2 dP 
47 ? dx 


(9.23) 


The variation of the drift velocity across the tube is therefore parabolic. 

The total mass of gas flowing in unit time through a given cross-section in 
which the pressure is Pis therefore 

J a P 

- 2nr U x dr 

r=0 R m T 


using P = pR m T where R m is the gas constant for unit mass of gas. The integra¬ 
tion over r is straightforward after substituting for U x and leads to a mass flow 
through a given cross-section of 


m 0 


-m 4 dP 
8 r\R m T dx 


The mass flow through each section of the tube must be a constant so after a 
further integration along the tube of length / 


na 4 Pi 2 - P 2 2 Tta 4 ~~ AP 

m 0 =-= - P — 

1 6r)R m T l 8 vR m T / 


(9.24) 


where P = (Pi + P 2 )/2 is the average pressure along the tube. 

This equation shows the importance of connecting the system to be evacuated 
to a pump by tubing of short length and large diameter since a pump can at best 
produce P 2 equal to zero. 

When the pressure gradient is small and the mean pressure is such that the gas 
in the tube may be considered to be in region II, a flow experiment and equation 
9.24 may be used to measure the coefficient of viscosity of a gas. As the mean 
pressure is lowered however the gas enters region III (a X > d) and the coeffi¬ 
cient of viscosity calculated using equation 9.24 now becomes a function of the 
diameter of the tube. The equation is inadequate in region III because it is no 
longer permissible to assume that the gas in the layer next to the wall of the 
tube is at rest. 

Knudsen suggested an improvement to equation 9.24 based on the assumption 
that the mass velocity of the gas was such that it would extrapolate to zero at a 
distance equal to (a + X) from the centre of the tube where X is the mean free 
path in the gas. In region II the mean free path is sufficiently small relative to 
the tube diameter for this correction to be negligible but this is not the case in 
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region III. The mass flow per second may now be shown to be 


m Q 


7 m 


%r\R m T 



A P 
l 


(9.25) 


where r x is a constant at a given temperature equal to XP. The second term in 
the bracket obviously becomes of increasing importance for a given tube as the 
mean pressure in the tube decreases and at a given mean pressure it is more 
important for small tubes than large ones. It may be shown that the form of r x 
implies that the surface of the tube appears completely rough to a molecule so 
that it is reflected in a direction which is independent of the direction from 
which it arrived on the surface. This point is discussed further in the next 
section. 

As the mean pressure in the tube decreases into region IV (X > a > d) 
equation 9.25 also ceases to apply because the whole concept of layers of gas 
with different mass velocities is now invalid. The molecules now make more 
collisions with the walls of the tube than with each other. The mass flow for a 
given tube and pressure gradient is found to decrease with the mean pressure in 
agreement with equation 9.25 but then passes through a minimum as shown in 
figure 9.7. At still lower pressure a new process leads to an increase in the mass 
flow. This free molecular flow in region IV will be discussed in the next section. 



Figure 9.7 The mass flow of a gas along a tube as a function of the mean pressure shows a 
minimum. 


9.4.3 Transport properties at low density (region IV) 

In region IV the gas molecules make more collisions with the walls of the con¬ 
tainer than intermolecular collisions. The simplest approximate treatment of this 
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region therefore consists of improving intermodular collisions completely and 
concentrating attention on the interaction of gas molecules with a surface. 

A molecule which strikes a surface may return to the bulk of the gas immedi¬ 
ately (after making an elastic collision) in a direction such that the angle of 
incidence of the molecule is equal to the angle of reflection. In optics this 
process is called specular reflection. The condition for specular reflection is that 


X > d sin 0 


(9.26) 


where X is the wavelength of the radiation, 0 the glancing angle with the surface 
and d the height of a typical surface irregularity. A gas molecule may be des¬ 
cribed by a de Broglie wave and it may easily be shown (exercise 9.5) that even 
for a good polished surface (d « 10 -7 m) specular reflection is only possible for 
molecules with very small glancing angles. The best surfaces are of course the 
faces of crystals for which d « 10 -10 m and specular reflection may then be 
observed up to glancing angles of about 30° for light atoms. 

The majority of the molecules leaving the surface of a tube would therefore 
be expected to return to the gas in a direction independent of the angle of 
incidence (diffuse reflection) because of the roughness of the surface, but 
diffuse reflection will also result from any surface if a molecule remains in con¬ 
tact with the surface for some period of time before it returns to the gas. The 
experiments of Knudsen suggest that the equation deduced for the flux of 
molecules on to the wall of a container (equation 8.23) is also correct for the 
flux of molecules returning to the bulk of the gas from the surface. 

A simple calculation of the mass flow of molecules at very low density along 
a long tube is possible if intermolecular collisions are neglected and the drift 
velocity along the tube is much less than the mean thermal speed of the gas 
molecules. The velocity gradient across the tube which was of vital importance 
in regions II and III may now be neglected. 

The number of molecules crossing unit area in unit time will be assumed to 
be m7/4 as for a gas in thermal equilibrium. The total momentum in the direction 
of gas flow given up to the walls of a tube of length / and radius a in unit time if 
the surface of the tube is rough is 


{2ml) j mU 0 


(9.27) 


where U 0 is the drift velocity down the tube. The force given by equation 9.27 
must be balanced by the pressure difference across the ends of the tube so 


2nal — mU 0 = na 2 (P x 
4 


The drift velocity is therefore 


2 a(P x - P 2 ) 
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and the total mass flow in unit time is found from the number of molecules in a 
cylinder of radius a and length U 0 l. 

mo = ira 2 nmU 0 


where m is the mass of a molecule. Therefore 

2na 3 A P 

m 0 = - — 

v l 


(9.28) 


The mass flow along a tube when the pressure is sufficiently low for the mean 
free path to be greater than the diameter of the tube is independent of the den¬ 
sity of the gas and of the coefficient of viscosity. 

This calculation for the mass flow along a long tube may be compared with 
the result for the effusion of a gas through a thin slit of radius a. The mass flow 
along the long tube is reduced by a factor of (1/tt) relative to the effusion process, 
once again showing the importance of short pumping lines of large diameter. 

The free molecular mass flow given by equation 9.28 is much greater than 
that expected from Poiseuille flow at low pressure (exercise 9.2). The pumping 
rate given by equation 9.28 is a function of the mass of a molecule of the gas 
(through!; ) so when a system is being evacuated the concentration of the 
residual gases will change with time (exercise 9.3). 

The problem of free molecular heat conduction may be treated in a similar 
way to free molecular mass flow but is complicated by the fact that in general 
the molecules returning to the bulk of the gas do not have a Maxwellian distribu¬ 
tion of velocities corresponding to the temperature of the heated surface. 

In a system in thermal equilibrium at temperature T the energy brought to 
unit area of surface in unit time by the molecules of a monatomic gas with 
speed between u and v + du is given by equation 8.23 multiplied by mv 2 /2. The 
total energy delivered to the surface is therefore 

nmv 3 
8 

The value of v 3 was found in exercise 8.1 to be 4kTv/m so the total energy 
delivered to unit area in unit time may be written 

Q = \nvkT = \Pv (9.29) 

which for a system in thermal equilibrium must also be the energy transfer from 
the wall to the gas. 

When a surface in a gas at temperature T is heated to temperature 7\ the net 
rate of transfer of heat per unit area is 

Q = \nvk(Ti - T) = Pv 


(9.30) 
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provided that the molecules actually leave the surface as if with temperature 7\. 
In general the molecules appear to leave with an effective temperature between 
T and 7\ and a correction must be made to the equation. 

The Pirani gauge is a simple and useful pressure gauge based on equation 
9.30. A thin tungsten wire of diameter d and length / is heated electrically to 
temperature T x . The temperature of the wire is found from its resistance (V/l) 
and the power dissipated in the wire is simply IV. Then 


Vndl{T x - T) , SkT 1 

IV = --' / - p 

_ 2 T v Tim 


(9.31) 


and the pressure may be calculated provided the gas in the system is known. The 
Pirani gauge is usually calibrated against a McLeod gauge rather than by using 
equation 9.31 because the calculated pressure depends upon the type of mole¬ 
cules in the system and it is not known if the molecules leave the wire with 
velocities given by T x . The Pirani gauge is useful in the pressure range 10 -2 to 
10 -4 torr but at lower pressure the heat loss by radiation becomes more impor¬ 
tant than the free molecular heat conduction (exercise 9.4). 


9.5 THE BOLTZMANN TRANSPORT EQUATION 


The rigorous theory of the transport coefficients of a perfect gas was begun by 
Boltzmann and developed by Chapman and Enskog. A gas in region II is suffi¬ 
ciently dilute for only binary collisions of the molecules to be of importance 
and the transport theory may then be developed in terms of the single-particle 
distribution function discussed in earlier chapters. The theory of the dense gas 
however requires the distribution function for pairs of molecules and will not 
be considered here. 

The generalised single-particle distribution function/(u, r , f) is defined 
f(y, r, Qd 3 vd 3 r is the average number of molecules with velocity in the range v 
to v + dv and position in the range r to r + dr at time t. 

In an isolated system the generalised distribution function will tend to the 
equilibrium Maxwell distribution of chapter 8 for all t greater than the slowest 
relaxation time in the gas. 

The behaviour of the generalised distribution function in the absence of 
molecular collisions is governed by the equation (Liouville’s equation of classical 
mechanics) 


3/ dr 3/ dv 3/ 

— =0 
3 1 d t 3 r d t dv 


(9.32) 


provided that the force acting on a particle is independent of its velocity. The 
force is then given by 
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Equation 9.32 can be extended to include the effect of magnetic fields (which 
lead to velocity-dependent forces) but this will not be done here. 

The equation may then be rewritten 


bf 

dt 


+ v • 


K 

dr 


0 

m dr 


(9.33) 


in the absence of collisions. 

The effect of interparticle collisions is to tend to make the system relax 
towards local thermal equilibrium. A collision term may be added to the right- 
hand side of equation 9.33 to give the Boltzmann transport equation in the form 


a/ a/ f a / 

— + v • — + - • — 
dt dr m dv 



The collision term was shown by Boltzmann to involve an integral over the 
(unknown) generalised distribution function. The Boltzmann transport equation 
is therefore an integro-differential equation which cannot be solved exactly. 

The equation is normally simplified by assuming (as in section 9.4) that the 
force acting on a particle is sufficiently weak for the transport coefficients to be 
independent of the force. The most general treatment of the Boltzmann equation 
(the variational method) which leads to expressions for the transport coefficients 
in terms of integrals which depend upon the law of force between the molecules, 
will not be considered here but instead a simpler treatment will be given. 

The relaxation time approach replaces the collision term by the expression 

/ a A = _ LzJl (9.34) 

\ dt) con T 

where r is the relaxation time and/ 0 the equilibrium distribution. The effect of 
the collision term may be seen by ‘turning off the external force which has 
driven the system from equilibrium. Then 

V = Kf - /o) = f-fo 
dt dt T 


since f 0 is independent of time, and the solution of the differential equation is 

r 

M -fo= (/CO) -/„) ~ T 

where /(0) is the value of the distribution function at the time when the force is 
switched off. The general distribution function therefore approaches the value 
for thermal equilibrium with characteristic time r. The relaxation time approach 
can be rigorously justified for certain types of intermolecular interaction 
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(Wannier (1966) chapter 18) and is also often used in solid-state physics to give 
approximate results. A system will have a number of relaxation times (which 
may be functions of the energy of the particles) but the longest relaxation time 
is normally the one of interest. 

The application of the Boltzmann transport equation in the form 


V\ + .. V t F _ V _ f - h 

b1) br m bv r 


(9.35) 


with a single value for r leads to values for the coefficient of viscosity and 
thermal conductivity with numerical coefficients of 1/3 (as in equation 9.16) 
which have already been seen to be inadequate. These calculations will not 
therefore be considered here but, as an illustration of the use of the transport 
equation, an expression for the electrical conductivity of a gas containing a low 
concentration of positive ions will be derived. This calculation has been chosen 
since it shows particularly clearly that the transport properties are due to the 
deviation of the system from thermal equilibrium. 

The force term in equation 9.35 may be written eE x x for a uniform electric 
field in the x-direction acting on a particle with charge e and the first term in 
the equation is zero if E x x does not change with time. The equation may now 
be written in the one-dimensional form 


V . _ f-fo 

X bx m \bv) t 

since a net current can only flow in the x-direction. 

The generalised distribution function may therefore be written 


f = fo 


(: 


eE x 3/ 


+ Vy 


m bv x bx 


9A 

9x / 


(9.36) 


The distribution function before the application of the electric field was simply 
the Maxwell velocity distribution 


/o = n 



mv 2 

2kT 


(9.37) 


where n is the number of ions per unit volume of the gas. The system is still at 
uniform temperature after the application of the field and if E x is independent 
of position as well as time 
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The electrical conductivity (a) is defined as the ratio of the current density 
( J x ) to the electric field 


o 


Jx 

E~x 


(9.38) 


where J x (defined as the average electric charge crossing unit area in unit time) is 
given by 


Jx 



e v x f d 3 u 


(9.39) 


At small values of the electric field the electrical conductivity is independent of 
E x (Ohm’s law) and / must then be such that 

1/ - /o I < /o (9.40) 

As a first approximation therefore the substitution 

3/ ^ % 

dv x dv x 


may be made on the right-hand of equation 9.36. Equation 9.39 now becomes 





e v x f 0 d 3 t> - 



3/o 

T - 

dv x 


d 3 u 


(9.41) 


The first term on the right-hand side is equal to nev x and v x is equal to zero 
(equation 8.10). The current density is due to the second term, which represents 
the deviation of the system from equilibrium. The Maxwell distribution for /„ 
(equation 9.37) leads to 


a/o 

dv x 


mv x 

kT 


fo 


J x = 



J 


T V x 2 fo d 3 u 

oo 


The integral cannot be evaluated without some assumption about the dependence 
of t on the velocity (energy) of the particles but if it can be removed from the 
integral and represented by some average value 7 then 


Jx = 


ne 2 E x r —r 


kT 


ne 2 r 


m 
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(using equation 8.12) and 
J 

o - — 

E x 

The electrical conductivity is independent of the field, in agreement with 
Ohm’s law, provided that J is independent of the field. 

The equation for the electrical conductivity could in fact have been obtained 
by the much simpler mean free path approach adopted in section 9.4 (writing 
X^vf where v is the mean speed) but the derivation using the Boltzmann 
equation has clarified the condition (equation 9.40) for which Ohm’s law will 
apply. This inequality may be written (using equation 9.36) in the form 

kT > w x eE x 

kT > XeE x (9.43) 

So Ohm’s law is valid if the energy gained by a particle from the field between 
collisions is much less than its thermal energy. This condition was assumed 
throughout section 9.4 but can now be seen to arise naturally in the solution to 
the Boltzmann equation. 

The many assumptions and approximations that had to be introduced in the 
derivation of the electrical conductivity of a dilute gas demonstrates how much 
more difficult transport theory is than the study of equilibrium properties. The 
problem of transport theory in a dense gas or a liquid in which collisions of 
more than two particles cannot be ignored is obviously still more complicated 
and is an active branch of current research. 
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ne 2 r 


m 


(9.42) 


9.6 CONCLUSION 

In chapter 8 exact expressions were obtained for the thermal properties of a 
classical perfect gas in equilibrium. Only approximate values for the transport 
coefficients were obtained in this chapter but they are of more general interest 
than might be thought from the discussion to this point. The direct application 
of the kinetic theory to vacuum physics, surface physics and atomic beams is 
obviously of great importance, but in addition the perfect gas calculations have 
often been taken over into solid-state physics. 

The application of the kinetic theory results for the electrical and thermal 
conductivities of a gas to the free electrons in a metal is discussed in section 10.1. 
This is not a good example however because (as was seen in section 6.5) the free 
electrons in a metal are not in the classical limit and must be treated by 
Fermi - Dirac statistics. The kinetic theory equation for the thermal conductivity 
of a gas (equation 9.16) 


K = jncyvX 
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has however often been used to discuss the thermal conductivity of a solid due 
to lattice vibrations. A mean free path for the thermal conductivity is indeed 
often calculated from the measured thermal conductivity and specific heat but 
it should be clear from the discussion in section 9.4 that the factor of 1/3 
probably has little significance. The heat transport in an insulator may be 
thought of as carried by quasi-particles called phonons (in analogy with the 
particle-like (photon) aspect of the electromagnetic field) as is discussed in 
sections 10.2 and 10.3 and the mean free path of a phonon increases at low 
temperature due to the decrease in the amplitude of lattice vibrations. The 
lattice thermal conductivity is therefore found to increase below room tempera¬ 
ture. The mean free path cannot however exceed the diameter of the sample rod 
and therefore becomes constant at sufficiently low temperature. The kinetic 
theory result of equation 9.16 is then sufficient to predict that the thermal 
conductivity of a pure insulator will become proportional to the product of the 
specific heat and sample diameter at sufficiently low temperature. Furthermore 
the lattice specific heat is proportional to T 3 at low temperature (section 10.3) 
so the lattice thermal conductivity must show a maximum as a function of 
temperature (figure 9.8). 

The results of the kinetic theory derived in chapters 8 and 9 are therefore 
not only of use in calculating the properties of perfect gases but are also 
employed in other branches of physics. 



Figure 9.8 The thermal conductivity of a pure crystal of an insulating solid (A1 2 0 3 ) depends 
upon the diameter of the sample at low temperature: x 2.8 mm; • 1.02 mm. (After 
R. Berman, E. L. Foster and J. M. ZimaniVoc. R. Soc. 231 (1955) p. 130.) 
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9.1 The coefficient of viscosity of helium at N.T.P. is 1.86 x 10 -5 kg m -1 s -1 . 
Find (a) the thermal conductivity, (b) the mean free path, (c) the molecular 
diameter, (d) the pressure at which the mean free path is equal to 10 -2 m at 
300 K and at 30 K. 

9.2 Find the ratio of the mass flow along a tube expected for Poiseuille flow and 
free molecular flow when the mean free path is equal to the tube diameter. 

9.3 Find an expression for the partial pressures of the residual gases in a vacuum 
system as a function of time in the region of free molecular flow. How does the 
time constant depend upon the mass of a molecule? 

9.4 A Pirani gauge ceases to be useful when the heat loss from the wire due to 
radiation becomes comparable with the heat flow in the gas. Estimate this 
pressure if the wire radiates as a black body, the residual gas is oxygen, the 
system is at room temperature and the temperature difference between wire and 
gas is small. 

9.5 Show that specular reflection is only possible at room temperature for 
hydrogen atoms with glancing angles less than 5 minutes of arc if the surface 
irregularities are of order 10“ 7 m. 



PART IV 


APPLICATIONS OF 
THERMODYNAMICS AND 
STATISTICAL MECHANICS 



10. Further Applications 


The basic principles of both the macroscopic (thermodynamic) approach and the 
microscopic (statistical mechanical) approach to systems in thermal equilibrium 
have been established in parts I and II. In the present chapter a number of 
examples are given of the application of both approaches to systems which have 
features which are currently of interest in physics research. 

There is a basic difference between statistical mechanics and thermodynamics 
which should be appreciated before one begins to apply the equations to real 
systems. The functional results of thermodynamics, the difference between Cp 
and Cy for example, are valid for any system in thermal equilibrium but the 
values of C P and C v individually, or of any other thermodynamic quantity, are 
unknown except at absolute zero. In chapter 7 it was shown that statistical 
mechanics could provide a general expression for the/specific heat at constant 
volume 


Cy = kp 2 



V 


where Z is the partition function and ft = kT~ l . A knowledge of the partition 
function is in fact sufficient to derive all the thermal properties of a system with 
a conserved number of particles in thermal equilibrium. The difficulty which 
arises is that, with a few exceptions, it is not possible to evaluate an exact parti¬ 
tion function for a real system. A model system is therefore chosen which should 
show the essential features of the real system and a partition function is derived 
for the model. This procedure is not of course peculiar to statistical mechanics 
but is the normal scientific approach to a complicated problem. One should 
always remember however that to say that a particular real system behaves as a 
‘perfect gas’ or that the conduction electrons in a metal may be treated as a 
quantum gas (neglecting interactions) is to make additional and possibly 
incorrect assumptions which are not required by thermodynamics. 



202 

10.1 QUANTUM GASES 


Thermal Physics 


In section 6.5 it was shown that the number of particles of a perfect gas in an 
energy level with degeneracy g r was 


e a+0e>- + 1 

(equation 6.43) where the positive sign applies to fermions and the negative sign 
to bosons. The parameter 0 is as usual equal to 1 jkT and a is defined by the 
equation 


N = 2 

r 


Sr 

e a+0e r ± 1 


( 10 . 1 ) 


where TV is the total number of particles. This equation can always be turned 
into an integral over the density of states function for fermions but it will be 
seen later in this section that difficulties arise for bosons at low temperature. 
At high temperature however equation 10.1 may always be written 


N = 2irV g 



e 2 de 

e a+pe + i 


(equation 6.46) and the classical limit was seen in section 6.5 to occur when 
exp ol > 1. A gas of molecules at N.T.P. is safely within the classical limit (see 
exercise 6.7) but it is of some interest to find the first quantum correction term 
when exp ot is large but the term ± 1 in the denominator of equation 6.46 can¬ 
not be ignored completely. A gas under these conditions is said to be slightly 
degenerate. 

The mean energy of the gas may be written 


E = TVe 



A 

e 2 de 

e c*+0e ± i 


( 10 . 2 ) 


The integral cannot be evaluated in closed form, but for exp oc much greater or 
less than unity, it may be expressed in the form of an infinite series. Only the 
first correction term is in fact of interest for a slightly degenerate gas and is 
found to be (Huang, 1963) 

E = f NkT I j ± _ h 3 N_ _ 

\ 2 * g Q'nmkTji V 

where the positive sign refers to fermions and the negative sign to bosons. 

Since the mean energy of a classical perfect gas is 3NkT/2 (equation 8.18) the 
effect of the Pauli exclusion principle is to increase the mean energy of a gas of 
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fermions. The free particles therefore behave as if there were a repulsive inter¬ 
action between them. Similarly bosons have a lower mean energy than a gas of 
classical particles which is equivalent to an effective attraction between the 
particles. The classical perfect gas was defined to have a mean energy indepen¬ 
dent of its volume but it is clear from equation 10.4 that the mean energy of the 
quantum perfect gas does depend upon the volume, due to the effective inter¬ 
action between the particles introduced by quantum mechanics. 

The correction term to the classical mean energy shown in equation 10.3 is 
in fact quite negligible for even the lightest molecules. At N.T.P. the value of the 
second term in the bracket for molecular hydrogen (a boson gas), for example is 
« 10 -6 . The mean energy of the gas may be related to the pressure exerted by 
the gas using the relationship 

E = \ PV (10.4) 

which has already been proved for a classical gas (equation 7.67). The extension 
of equation 10.4 to quantum gases follows because the volume dependence of 
the energy of a given energy level is (appendix IV) 


h 2 n 2 
Sit 2 m FT 


(10.5) 


It may be shown from quantum mechanics that a particle in the state n will 
remain in that state if the volume is changed sufficiently slowly under adiabatic 
conditions. The change in the mean energy with volume is related to the pressure 
by the thermodynamic equation 


P 



( 10 . 6 ) 


and if the number of particles in each state is unchanged the right-hand side of 
equation 10.6 is equal to — 2Ej3 V from which equation 10.4 follows immedi¬ 
ately. 

The pressure exerted by a slightly degenerate gas may therefore be written 


PV = NkT 


1 ± 


A7z 3 


2T g (2TtmkT)2 V 


(10.7) 


where the positive sign applies to fermions and the negative sign to bosons. 

Although this equation has the form of the virial expansion introduced in 
section 2.2 it should be clear from the earlier discussion of the magnitude of the 
correction term that this quantum correction is not of practical importance for 
real gases. 
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( 10 . 8 ) 


(10.9) 


is (apart from a small numerical factor) the de Broglie wavelength of a typical 
particle at temperature T (equation 6.50). The correction term therefore de¬ 
pends upon the ratio of the de Broglie wavelength to the mean separation of the 
particles. While this quantum correction is always negligible (relative to the effect 
of particle - particle interactions in a real gas) a quantum correction to the virial 
coefficient calculated by classical physics is necessary when the de Broglie wave¬ 
length becomes of the order of magnitude of the dimensions of a molecule, as is 
possible for hydrogen and helium. 

It might appear from the discussion up to this point that the theory of the 
quantum gases was of little practical importance. While it is certainly true that 
the effects discussed so far are quite negligible for gases there are systems which 
may in some sense be treated as weakly interacting to which the quantum gas 
calculations apply at least approximately. (Bose - Einstein statistics also apply to 
radiation (photons) as will be discussed in the next section.) 

The conditions under which quantum effects will become pronounced (a 
degenerate gas) are clearly just the converse of the conditions for the classical 
limit. When the particles have small mass, the system is at low temperature and 
the number density of particles is large, the classical approximation ceases to 
be satisfactory. It does not however follow that the resulting system will be a 
perfect quantum gas since the combination of low temperature and high 
number density is likely to lead to strong interactions between the particles. 

The neglect of these interactions must therefore be justified for any system to 
which the quantum gas calculations are applied. It is perhaps worth repeating 
that the expression ‘low temperature’ applied to quantum systems does not 
necessarily mean a small number on the absolute scale of temperature but rather 
that the system is at a temperature much lower than some characteristic tempera¬ 
ture for the system (section 6.3). The electrons in a white dwarf star for example 
at a temperature of 10 7 K will be seen later in this section to be in the low- 
temperature limit. 

The first application of Fermi - Dirac statistics to real systems was to the 
conduction electrons in metals. Drude had earlier given a classical theory of 
conductivity in metals which, although in some respects successful, contained 
features that were quite inexplicable within classical physics. 

The outstanding feature of metals is their high electrical and thermal con¬ 
ductivity. Drude suggested that the valence electrons in a metal become detached 
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from their parent atoms and are free to move through the crystal. These elec¬ 
trons are able to respond to a potential difference or a thermal gradient across 
the sample and are therefore called conduction electrons. Drude treated the 
conduction electrons as a Maxwell - Boltzmann gas and used the kinetic theory 
results for the thermal conductivity (equation 9.16) and the electrical con¬ 
ductivity (equation 9.42) leading to the expression 


K 

oT 


k\ 2 


( 10 . 10 ) 


which is a constant for any metal. This remarkable result is in good agreement 
with experiment except at low temperature. 

The concept of the free-electron gas must be taken seriously but, as was 
noted earlier in the discussion of the transport theory of gases (chapter 9) the 
agreement found between theory and experiment for the ratio of two quantities 
is not necessarily a proof of the correctness of the details of the theory but may 
be due to fortuitous cancellations in the individual terms. The temperature 
dependence of the electrical conductivity of metals for example is in fact quite 
different from that predicted by equation 9.42. A further difficulty of the 
classical theory of free electrons in metals is that the free electrons do not 
appear to contribute to the thermal capacity at room temperature. According 
to the equipartition theorem, N free electrons (one per atom in silver for 
example) have a thermal capacity of 3Nk/2. The thermal capacity of a metal 
should therefore be greater than that of an insulator by this amount. At room 
temperature however there is little difference between the thermal capacities of 
metals and insulators. 

Sommerfeld realised that even if the conduction electrons could be treated 
as a gas the condition for the classical limit (equation 6.49) would not be 
satisfied due to the small mass of the electron and the high number density if 
each atom in the metal contributed one or more free electrons (exercise 6.7). 
The electron gas in a metal is in fact in the extreme quantum limit and has a 
large mean energy even at the absolute zero of temperature, due to the effect of 
the Pauli exclusion principle which allows only one electron to occupy each 
single-particle state. 

Equation 6.43 will be rewritten in the standard notation of solid state 
physics by substituting - jfyx for ot. Then at absolute zero the probability of 
occupancy of a single-particle state r of a gas of fermions 


p r =- i - (10.11) 

e 0(e,-M) + 1 

is unity for all states with energy less than /x and zero for all states with energy 
greater than /x (figure 10.1). The value of fi at absolute zero is usually written 
ep (the Fermi energy). The Fermi energy may be found from equation 6.46 in 
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Figure 10.1 The Fermi-Dirac distribution function at absolute zero and at temperature T 
(T < Tp). The energy range equivalent to kT is shown. 


the form 

„ (2rn \2 /• €f l , 

N = 2irVg J J o e* de (10.12) 

since no states are occupied above ej.-. Then 



where the factor of 2 has been substituted for g since electrons have a spin of 

1 / 2 . 

Similarly the mean energy of the gas at absolute zero is (from equation 

10 . 12 ) 


E = 2nVg 



C € F 1 

Jo e ’ 


de 


= $Ne P 


(10.14) 


Hie value of the Fermi energy of a metal found from equation 10.13 assuming 
that each atom contributes one free electron is typically about 5 eV. A Fermi 
temperature may be defined as usual by the equation 


£[r - kTp 
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and for 5 eV is approximately equal to 5 x 10 4 K. A normal laboratory tem¬ 
perature is therefore much less than the Fermi temperature and the mean energy 
of the electron gas at 0 K is equal to that of a classical gas at 2 x 10 4 K. 

At temperatures above absolute zero the value of jjl is no longer equal to the 
value e F but must be found from equation 6.46 (where a = — j3/z). When e F is 
5 eV however the value of jti is found to decrease by only about 0.04 per cent of 
e F by room temperature. The main effect of the increase in temperature is that 
some electrons are now excited above the Fermi level (figure 10.1). The bulk of 
the electrons do not change their energy and so do not contribute to the specific 
heat. 




Figure 10.2 (a) The form of the density of single-particle states as a function of energy for 
free electrons, (b) The number of electrons per unit energy range at absolute zero and 
at temperature T(T < Tp). The energy range equivalent to kT is shown. 
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The change in the populations of the energy states with temperature is 
shown in figure 10.2. The number of particles in a small range of energies de is 
given by 


dN = 2 ;rVg 



62 


de 


e 0(e-M) + 1 


(10.15) 


At absolute zero there are no electrons above e F but at temperature T the 
electrons with energy within « kT of the Fermi energy move into higher energy 
states. The bulk of the electrons cannot respond to the temperature change 
because there are no unoccupied states within the energy range kT if T< T F . 

The number of excited electrons is 


N*. 



and each excited electron has gained energy of order kT so 


A E 


NkT 2 
T f 


and the heat capacity due to the free electrons is 


2NkT 

Cel * - (T < 7» (10.16) 

T f 

This expression is almost correct. The rigorous calculation using equation 6.46 
to find ju as a function of temperature leads to 

ir 2 NkT 

Cqi = (r < Tp) ( 10 . 17 ) 

2T f 

for N free electrons. The classical theory would of course predict 

C e i = f Nk (classical) (10.18) 


which is some fifty times greater at room temperature. The electronic contribu¬ 
tion to the specific heat at room temperature is therefore negligible relative to 
the lattice specific heat which is of order 3Nk since at high temperature the 
lattice vibrations may be treated as independent three-dimensional simple 
harmonic oscillators (section 10.3). The quantum theory of the electron gas 
therefore explains the main discrepancy of the classical theory of the free 
electrons in metals. At low temperature the electron contribution to the specific 
heat may be observed because it decreases more slowly with temperature than 
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the lattice contribution (see figures 4.2 and 10.9). 

The quantum theory also reproduces the most striking success of the classical 
theory (equation 10.10) with a slightly different numerical coefficient. 



k\ 2 


(10.19) 


where L is called the Lorentz number and is equal to 2.5 x 10” 8 watt ohm K -2 . 
This equation, as has already been remarked, is successful near room temperature 
but as in the kinetic theory discussion of chapter 9 it is clear that a ratio of this 
type involves identical mean free paths for both processes. At low temperature 
this ceases to be true and L may decrease by up to an order of magnitude. 

One further application of the Fermi - Dirac statistics will be briefly discussed 
in this section. The Fermi energy of a metal has been seen to be typically 5 eV, 
equivalent to a Fermi temperature of 5 x 10 4 K. A normal laboratory experi¬ 
ment is therefore performed at a temperature T and indeed the melting 
point of silver for example is only « 10 3 K. Since the magnitude of the Fermi 
temperature depends upon the number density of electrons in the system 
(equation 10.13) a star may be in the low-temperature (T < 7p) limit even 
though the temperature is extremely high (« 10 7 K) if the density of matter 
contained in it is much greater than normal. The theory of the Fermi - Dirac gas 
has been applied to a class of stars known as white dwarf stars which are believed 
to consist almost entirely of helium. The temperature of the centre of the star 
(« 10 7 K) is sufficient to ionise the helium atoms and the density of the helium 
is such that the concentration of free electrons is about 10 36 m~ 3 , correspond¬ 
ing to a Fermi temperature of 10 11 K. The electron gas is therefore in the 
degenerate limit T < 7p. The astronomical problem is however more complicated 
than that of the metal, since the electrons are moving sufficiently rapidly to 
have to be treated by relativistic mechanics. The equilibrium of the star also 
requires that the pressure exerted by the Fermi gas (exercise 10.2) be balanced 
by the gravitational attraction of the helium nuclei. The calculation of the 
mass - radius relationship for a white dwarf star will not be performed here 
[see Huang (1963) or Kittel (1969)] but leads to the conclusion that the mass 
of a star cannot be greater than 1.4 times the mass of the sun if it is to become 
a white dwarf. A greater mass exerts such a large gravitational attraction that it 
cannot be balanced by the pressure exerted by the Fermi gas. This prediction 
appears to be in agreement with astronomical observation and represents a 
striking example of the application of quantum gas theory to a system which 
could hardly be further from our original picture of a gas as a widely separated 
set of particles. 

The justification for the application of quantum gas theory to dense systems 
of rapidly moving charged particles such as electrons is really part of solid-state 
physics and will not be considered in detail here, but the physical idea behind 
the theory is fairly easy to see. The electrons are free to move against a back¬ 
ground of ionised atoms so the long-range Coulomb interaction between two 
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bare electrons is screened by the positive background and the interaction is then 
found to depend exponentially on distance. The efficiency of the screening 
increases with the density of particles so the importance of interactions between 
the electrons (the ratio of potential to kinetic energy) actually decreases with 
increasing density. A dense gas of electrons (with an equal background charge of 
opposite sign) is a more perfect gas than a similar dilute gas. 

An electron - electron scattering process is also restricted by the exclusion 
principle. The final electron states must be unoccupied if the scattering process 
is to occur, so only electrons with an energy within about kT of the Fermi level 
are affected. The effect of electron - electron interactions can therefore be 
ignored, to a first approximation. It is also easy to show that an electron is not 
scattered at all by a regular lattice so the electron-lattice scattering (leading to 
a finite electrical conductivity) is a weak interaction due only to the small - 
amplitude vibrations of the ions about their equilibrium positions. The mean 
free path of an electron in a pure metal is therefore expected to be much greater 
than a lattice spacing and this is supported by experiment. The mean free path 
of an electron in copper at room temperature is found to be about one hundred 
lattice spacings and increases at lower temperatures. 

The Bose - Einstein distribution has also been applied to a system which is 
very different from a gas. Helium is the only substance not to solidify under 
atmospheric pressure at low temperature and it is believed that even at absolute 
zero it would continue to exist as a liquid under conditions of saturated vapour 
pressure. A pressure of at least 25 atmosphere is required to solidify liquid 
helium. Helium liquefies at 4.2 K at atmospheric pressure and behaves as a 
normal fluid of low viscosity until a temperature of 2.17 K. At lower tempera¬ 
ture the properties of liquid helium are unique. liquid helium is often called 
helium I in the normal region and helium II in the temperature region below 
2.17 K. Helium II appears to have an enormously high thermal conductivity and 
the value of its coefficient of viscosity depends upon the method of measure¬ 
ment. A Poiseuille flow experiment using tubes of small diameter suggests that 
the coefficient of viscosity is zero but the damping of the oscillations of a 
pendulum leads to a finite value for the viscosity. A partial explanation of these 
properties (due to Tisza) considers helium I to be a normal fluid but helium II 
to be a mixture of normal fluid and superfluid. The concentrations of the two 
fluids are a function of temperature. The superfluid has zero viscosity and 
therefore determines the flow in small tubes but the normal component which 
also exists at finite temperature damps the motion of a pendulum. Helium II has 
many other extraordinary properties which will not be considered here. 

Natural helium is composed almost entirely of the isotope 4 He which is a 
boson system. The rare isotope of helium 3 He is a fermion and liquid 3 He does 
not show a transition at 2.17 K. It is natural to conclude that the superfluid 
properties of helium II are connected with the statistics of 4 He. (Very recent 
research however has shown that 3 He is also a superfluid at temperatures below 
3 mK.) There is less justification for treating liquid helium as a gas of bosons 
than there was for applying the theory of the Fermi gas to the free electrons in 



Further Applications 211 

metals, since interactions between the atoms must be of importance, but an 
approximate theory of helium II was developed by London following Einstein’s 
calculation for the quantum gas. 

Einstein noted that it was possible in certain circumstances for a large frac¬ 
tion of the particles of a boson gas to condense in the lowest energy level of the 
system. London suggested that the fraction of the particles in the lowest energy 
level corresponded to the superfluid component of helium II, and the particles in 
the upper states to the normal components. The difference between 4 He and 
3 He at low temperature would therefore follow since only one atom of 3 He 
could occupy the lowest state, due to the exclusion principle for fermions. 

The possibility of the condensation of a significant fraction of the molecules 
of a boson gas into the ground state will now be examined. The number of 
bosons in energy level r of degeneracy g r may be written (once again substituting 

a = - flu) 


Sr 

Hr =- 

e 0(e r -M) _ 1 


( 10 . 20 ) 


Now if the state of lowest energy (e 0 ) is taken to have an energy equal to zero it 
is apparent that ju must always be less than zero if n 0 is to be positive. The value 
of fi depends of course (if other factors are kept constant) on the temperature 
and at sufficiently high temperature will be large and negative (section 6.5). At 
some temperature it is possible that, for a given system, ju will approach the 
ground state energy of zero (from below) (figure 10.3). The number of particles 
in the ground state would then become much larger than that in the first excited 
state and would form a significant fraction of the particles in the system. The 
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Figure 10.3 In a degenerate gas of bosons the value of m may approach that of the ground 
state (from below) so that ImI is much less than the energy of the first excited state 
(ei). In the classical limit j u goes to minus infinity. 
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ratio of the numbers of bosons in the ground state ( n 0 ) and the first excited 
state (n x ) is (from equation 10.20) 

n Q qP( € i -/*) _ l 
n x e-fin - l 

and when ju is small (and negative) 


n 0 _ 

n i M 

so « 0 may become much greater than n x . 

A comparison with the behaviour of the classical gas may help to show the 
unique nature of the boson gas. The populations of the ground and first excited 
states of the classical gas are given by 

n 0 = n x eP € i 

Now e x may be written in temperature units as kT x and has been shown to be of 
the order of 10” 14 K (exercise 6.3). At all temperatures accessible to experiment 
therefore 



so the number of particles in the ground state is effectively equal to the number 
in the first (and many higher) excited states. 

The ground state of a boson system must therefore be treated with some care. 
The normal conversion of the sum (equation 10.1) into an integral involves the 
use of the density of states function 

(2 m\? i 

D(e) = 2nVg I —J e* de 

which excludes the ground state (e = 0) altogether. This approximation is always 
valid for fermions since the ground state can contain only one particle (apart 
from spin) and is also of no importance in the classical limit or the case of the 
slightly degenerate boson gas, where there are many states with effectively the 
same number of particles as the ground state, but must be improved upon for a 
degenerate gas of bosons. The simplest (and in fact adequate) approach is to use 
the normal density of states function, but to specify the ground state separately. 
If the number of particles in the ground state is N 0 and the number in all the 
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N = N 0 + JV, 
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e -0M _ l 



i; 


62 de 

e (e-ju)j3 __ 1 


At sufficiently high temperature we have already seen that ju is large and negative. 
In the high temperature limit therefore Ni is effectively equal to N. As the tem¬ 
perature is lowered (keeping the volume constant) it is clear from inspection of 
the integral that ju becomes less negative to keep N x equal to N. At some tem¬ 
perature T c , N x can only be equal to Af if ju is equal to zero. The temperature 
T c may be found from the equation 


N = 2irVg 



62 de 


t kT c - 1 


The integral may be expressed in the form of a standard integral, equation 
A2.9, to give 


N 


T c 



( 10 . 21 ) 


The critical temperature therefore occurs when the average particle separation is 
of the order of the average de Broglie wavelength of the particles. 

Now in fact n cannot be equal to zero but must be very small and negative at 
all temperatures equal to or less than T c . The number of particles in the excited 
states at temperature T is therefore 


N x = N 



(T < T c ) 


and the fraction in the ground state 
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The fraction of the particles of a boson gas in the ground state is therefore negli¬ 
gible above T c and increases at lower temperature as shown in figure 10.4. Since 
the'ground state has zero energy and zero momentum the behaviour of a boson 
gas below T c is sometimes described as a condensation in momentum space by 
analogy with the liquid - vapour transition where the liquid condenses in real 
space. 

The critical temperature of a boson gas calculated from equation 10.21 using 
the density of liquid helium (145 x 10 6 kg m“ 3 ) is 3.0 K, which is in quite good 
agreement with the experimental value of 2.17 K. The basic idea of the London 
theory of helium II, relating the superfluid component to the particles in the 
ground state, therefore seems to be valid and has been retained in more recent 
work, but the results of the theory of the boson gas are often in complete dis¬ 
agreement with the experimental results for liquid helium II. 

The temperature T c for a gas of bosons (equation 10.21) increases with the 
particle density and therefore with the pressure exerted on the system, but the 
superfluid transition in helium II decreases with pressure. The behaviour of the 
specific heat of helium II close to the superfluid transition (figure 10.5) is also 
not in agreement with the specific heat of a gas of bosons which varies as T 3 ^ 2 
below T c . The transition temperature in helium II is often called the lambda 
point (X), from the shape of figure 10.5. The transition of the boson gas is also 
fundamentally different from that in helium II in that it involves a latent heat 
(a first-order phase transition, section 10.4) but there appears to be no latent 
heat associated with the lambda point. A better theory of helium II must there¬ 
fore consider the theory of a quantum fluid rather than a quantum gas. 



T 


Tc 


Figure 10.4 The fraction of the particles in the ground state of a boson gas as a function of 
temperature. 



Further Applications 


215 



Figure 10.5 The specific heat of liquid helium under conditions of saturated vapour 
pressure shows a peak at 2.18 K (the lambda point). There is no latent heat associated 
with the peak. 


Hie theory of a degenerate boson gas does not apply to any real system of 
particles although it has obviously been of great importance for the qualitative 
understanding of helium II, but the original form due to Bose does apply exactly 
to the theory of radiation which will be discussed in the next section. 


10.2 BLACK-BODY RADIATION 

The experimental results for the behaviour of black-body radiation had been 
established by Stefan before 1880 but a satisfactory microscopic theory was not 
available until Planck introduced the quantum hypothesis in 1900. During the 
period 1880 -1900 however it became possible to understand many of the pro¬ 
perties of black-body radiation by purely thermodynamic arguments since 
thermodynamics does not require a microscopic theory for the system of 
interest. The present section therefore provides a good example of the usefulness 
of thermodynamics in the study of a system which is outside the scope of 
statistical mechanics at a given time. Another good example (which will not be 
considered in this book) is the superconductivity of certain metals at low tem¬ 
perature. The first experimental observation of superconductivity was made by 
Kammerlingh Onnes in 1911 and thermodynamics had to be used to clarify the 
nature of the transition before a microscopic theory of the effect was developed 
by Bardeen, Cooper and Schrieffer in 1957. 

The properties of the electromagnetic radiation in thermodynamic equilibrium 
with matter at temperature T are most easily studied by making a small hole in 
one side of a closed container. A sufficiently small hole will not disturb the 
equilibrium within the container (as in the case of gas effusion, section 8.5) and 
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the emitted radiation may be studied as a function of frequency and temperature. 
Radiation is often described as being ‘at temperature T although strictly the 
temperature refers to the temperature of the matter in equilibrium with the 
radiation. The radiation emitted from a small hole in a closed container is 
described as black-body radiation since all radiation incident on the hole from 
outside the container is absorbed and the hole therefore acts as a perfect 
absorber or black body. Black-body radiation contains the maximum possible 
amount of energy at every frequency for a given temperature because (as was 
first shown by Kirchoff) a perfect absorber is also a perfect emitter of radiation. 

The experiments of Stefan showed that the total energy of black-body 
radiation emitted per unit area per unit time at temperature T was of the form 

R = oT* (10.22) 

where T is the temperature on the thermodynamic scale and a is a constant (the 
Stefan constant, equal to 5.67 x 10" 8 watts m~ 2 K -4 ). This result was then 
derived using thermodynamics by Boltzmann, and equation 10.22 is often called 
the Stefan - Boltzmann law of black-body radiation. The derivation of equation 
10.22 is quite straightforward from the energy equation (equation 3.14) 

( d£ 0 -rf*) 

\dv J T Va tJv 

The relationship between the pressure and the internal energy of black-body 
radiation may be written 

PV = \U 
or 

P = j u (10.23) 

where u is the energy density. This equation may be derived either from classical 
electrodynamics or by treating the radiation as a gas of photons. The energy 
density of black-body radiation can only be a function of temperature, so 
equation 3.14 becomes 

T du u 

u = 

3d T 3 

Therefore 

T du 

u =- 

4 dr 

Therefore 

= aT A 


u 


(10.24) 
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where a is a constant. This equation is still not in the form required for a direct 
comparison with experiment since equation 10.22 refers to the radiation 
emitted from a hole in the container, not to the energy density inside it. The 
simplest way to establish a relationship between the two equations is again to 
consider a gas of photons and to employ the argument used for the effusion of a 
gas through a small hole. Since all photons travel with the speed of light the 
number passing through unit area per unit time is simply nc/4 where n is the 
number of photons per unit volume. If the mean energy of a photon is ~e then 

u = ne 

_ nc- uc acT 4 

R = € = __ = - 

4 4 4 


Therefore 

otc 

o = — (10.25) 

The temperature variation of the total energy of black-body radiation has there¬ 
fore been derived by a thermodynamic argument, but the value of the constant 
cannot be found by this method. 

The equation of state of black-body radiation may now be written 

P = - = - T* (10.26) 

3 3 

showing that the pressure is independent of the volume. The heat capacity of 
black-body radiation at constant pressure is infinite since an input of heat does 
not lead to an increase in temperature but the heat capacity of unit volume of 
radiation at constant volume (exercise 10.3) is very small relative to that of 
matter. 

The equation for a reversible adiabatic (isentropic) change will be seen later 
in this section to be of great importance. The required equation follows from the 
standard thermodynamic equation (3.15) 
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In an isentropic process d S is zero and therefore 


VdT = - - dV 
3 

VT 3 = constant (10.27) 


is the equation for an isentropic change of black-body radiation. 

This simple equation has been of importance firstly in the early attempt by 
Wien to understand the distribution of the energy of black-body radiation as a 
function of temperature (which will now be described) and more recently in the 
controversy concerning the origin of the universe. The equation 10.27 was used 
by Gamov to show that if the universe had begun as an extremely dense and hot 
concentration of matter and radiation (the primaeval fireball) which rapidly 
expanded (the big bang theory) then all space should be filled with black-body 
radiation. The detection of this radiation (which will be discussed at the end of 
this section) has been one of the most interesting developments in astronomy in 
recent years. 

The distribution of the energy of black-body radiation at temperature T as a 
function of frequency may be described by a function u(y , T ) such that 


Therefore 


u(v , T)dv is the energy density of black-body radiation 
in the frequency range v to v + dv. 


u(T) = f°° u(v,T)dv 
J o 


(10.28) 


The experimental form of u(v , T ) is shown in figure 10.6. At a given tempera¬ 
ture the function has a maximum value at some frequency (v max ) and decreases 
to zero at both limits. The value of i> max is seen to be proportional to the abso¬ 
lute temperature. The area under the curve for temperature T represents the 
total energy density and is therefore proportional to T* (from equation 10.22). 

The Stefan - Boltzmann law was derived (apart from the value of the constant) 
without any assumptions about the detailed nature of radiation but thermo¬ 
dynamics is not sufficient to derive the equation of the isotherms shown in 
figure 10.6. Wien however did extend the thermodynamic argument to show that 
the isotherms must have the form 


u(y , T) dv 



(10.29) 


= T 4 f(x) dx 


( 10 . 30 ) 
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Figure 10.6 The energy density of black-body radiation at temperature T as a function of 
frequency. 


where x is equal to vjT. The Stefan - Boltzmann law therefore follows immedi¬ 
ately 


u(T) = j; u(y, T)dv = T 4 f(x) dx (10.31) 

since the integral over x is an (unknown) constant. Wien’s argument will not be 
given in full here but, briefly, depends upon the calculation of the work done in 
an isentropic expansion by radiation pressure on a movable mirror. The radiation 
reflected from the mirror is Doppler shifted so that energy incident in the fre¬ 
quency range v to v + dv is reflected into a new frequency range v ' to v ' + di>' 
The calculation of the relationship between v and v ' and the use of equation 
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10.27 for an isentropic process then leads to equation 10.29. 

The maximum in each isotherm shown in figure 10.6 will, according to 
equation 10.29, have the form 



Therefore 

^max ft / J'max \ (^ m ax | = q 

T * \T ) \T ) ~ 

The solution to this equation (assuming that it exists) must be of the form 


^max , , 

-= constant 

T 


(10.32) 


in agreement with experiment. The equation of the maxima could also have 
been obtained by considering the energy density as a function of wavelength 
rather than frequency (figure 10.7) when 

u(T) = J“ t/(X, T ) dX (10.33) 

The form of m(X, T ) may be derived from equation 10.29 (exercise 10.4) 

(,o,4) 

(where X is the wavelength and c the velocity of light) and 


X ma xT = constant (10.35) 

This equation is usually called the Wien displacment law since it relates the 
position of the peak intensity (as a function of wavelength) to the temperature. 

It should be noted that the position of the maximum intensity measured as a 
function of wavelength is not the same as when measured as a function of 
frequency, that is to say c =£ ^ max X max (see exercise 10.4). 

Wien’s law contains all the information about black-body radiation which can 
be obtained from thermodynamics. Further progress therefore requires the use 
of statistical mechanics. However the classical statistical result (due to Rayleigh 
and Jeans) although giving excellent agreement with experiment at low frequency 
did not show a peak in the radiation isotherms at high frequency, but diverged 
(an effect sometimes called the ultraviolet catastrophe). The failure of classical 
statistical mechanics finally led Planck to introduce the concept of energy as a 
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Figure 10.7 The energy density of black-body radiation at temperature Tas a function of 
wavelength. 

quantised (rather than continuous) function. 

The Planck distribution law for u(v , T ) may be derived in two rather different 
ways due to Planck and Einstein. In the original classical calculation due to 
Rayleigh and Jeans the number of allowed modes of the electromagnetic radia¬ 
tion in the cavity were enumerated (the density of states function, appendix IV) 
and each mode was assumed to have the same mean energy as a linear simple 
harmonic oscillator. Then 


U(y, T)dv = Vu(y,T)dv = D(v) dv e 


Therefore 


u(y, T)dv = e 


(10.36) 


using equation A4.23, multiplied by two for the two transverse polarisations of 
an electromagnetic wave and 

c _ ck 
X 2ir 


v 
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The mean energy of a linear simple harmonic oscillator at temperature T , 
according to classical physics, is given by the equipartition theorem to be kT 
(section 7.6) so 

u(v,T) = v 1 dv (classical) (10.37) 

c 3 

This expression contains no unknown constants and is in excellent agreement 
with experiment at low frequencies but diverges at high frequencies instead of 
going to zero. 

Planck introduced the concept of the quantised simple harmonic oscillator 
with energy levels e = nh (where n is an integer and h a constant) to overcome 
this insuperable problem within classical physics. The mean energy of such an 
oscillator was calculated in section 7.4 


_ hv 
e = -- 

hv 

e kT _ i 

where the zero-point energy term hp/2 has been omitted since it leads to an 
infinite energy density at high frequencies. The Planck distribution law is 
therefore, substituting for e 7 in equation 10.36 


u(y, T ) dv 


Snh v 3 dv 

C 3 hv 

e kT - 1 


(10.38) 


This expression is identical to the classical result in the limit hv < kT but 
decreases to zero at high frequency. The frequency at which the distribution 
shows a maximum is given by 


( hv max\ 

1 - e“ ~* t ~ « 2.82 (10.39) 

in agreement with the form of equation 10.32. Equation 10.38 may also be 
written in the form 




in agreement with the thermodynamic result due to Wien. The value of the 
Stefan constant may be found from 


«m - % r 

c 3 


i> 3 di> 
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87 t(kT)* x 3 djc 

0 ch 3 ) Jo e* - 1 

and the equation 10.25 relating the energy density to the power radiated per 
unit area. Hence 

27tA: 4 x 3 djc 
c 2 h 3 Jo e x — 1 


The definite integral is equal to 7r 4 /15 (appendix II) so 

2 t T s k 4 

o = - 

15c 2 /z 3 


(10.40) 


The value of h can be found from either equation 10.39 or equation 10.40 and 
is in excellent agreement with the value found for the Planck constant in other 
experiments. In fact the most accurate value for the Stefan constant is obtained 
not by direct experiment but by calculation using equation 10.40. 

The Planck radiation equation was of course the first calculation in physics 
which treated the energy of a system as quantised. The calculation given above 
may be felt to be rather uncomfortably close to the original classical argument 
since the only difference is in the treatment of the mean energy of a simple 
harmonic oscillator. The simple harmonic oscillator was chosen by Planck as the 
simplest example of a radiating body but the same result can be obtained by 
more elaborate quantum-mechanical calculations. 

An alternative approach which has less connection with the classical argument 
is due to Einstein. In the theory of the photoelectric effect it is necessary to 
consider light not as continuous classical electromagnetic waves but rather as 
quanta of radiation (photons). The electromagnetic field in a container may be 
replaced by a gas of photons. The photons are indistinguishable particles with 
no restriction on the occupancy of energy levels (bosons) but unlike a gas 
of molecules the number of photons in the container is not constant. The 
Bose - Einstein distribution function has now to be calculated without the 
condition that led to the introduction of the undetermined multiplier a in 
section 6.5. The number of photons in the level r of degeneracy is therefore 


n r = - (photons) 

The energy of a photon is related to the frequency by 


(10.41) 


e = hv 



224 Thermal Physics 

The energy density of states function for photons is determined by the relation¬ 
ship 


e = cp 

(where p is the momentum), and the requirement that each photon state occupies 
a volume h 3 in phase space (appendix IV). The density of states function is in 
fact identical with the density of states for the normal modes (equation 10.36) 
so 


u(p) dv 



(10.42) 


in agreement with equation 10.38. The theory of black-body radiation can 
therefore be considered from either a wave or a particle viewpoint. 

The importance of the Planck theory of radiation in the history of atomic 
physics is too well known to be worth repeating here but the more recent use of 
the theory of radiation in cosmology and radioastronomy may not be so well 
known. 

According to the ‘big bang’ theory the present universe began at some 
definite time as an enormously dense and hot cloud of matter and radiation 
which then expanded and cooled. Gamow pointed out in 1948 that if the big 
bang theory was correct the universe must be flooded with black-body radiation 
which would look the same in all directions in space (isotropic radiation). The 
detection of this radiation in 1964 was therefore a most important check on the 
theory of the evolution of the universe from an initial hot phase and seems to 
disprove the theories which require either a cold start to the universe or a 
‘steady-state universe’ which has always looked as it does at present. 

The basic concept of the Gamow theory is that the energy density of radia¬ 
tion ( u ) was greater than that of matter during the early development of the 
universe after the big bang. The proof is straightforward and depends only upon 
the cosmological postulate of uniformity which states that the universe may be 
considered to be divided into cells within all of which the conditions are identical. 
(The concentration of matter into galaxies in the universe at present must either 
be ignored or averaged over a sufficiently large volume.) The problem of the 
expanding universe therefore reduces to that of the properties of matter and 
radiation in a closed container undergoing a reversible adiabatic (isentropic) 
change of volume. 

The present average density of matter in the universe is estimated to be 
10~ 27 kg m -3 and of radiation 10 -30 kg m“ 3 . The interaction between matter 
and radiation at present is extremely weak but we shall see that it must have 
been sufficiently strong at earlier stages in the evolution of the universe for the 
radiation to have been in thermal equilibrium with matter. The radiation must 
therefore be black-body radiation. 

The effect of decreasing our representative volume from its present value in 
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an isentropic process is to increase the energy density, as may be seen from the 
equations 10.24 and 10.27 

u = aT 4 

VT 3 = constant 

Therefore 

M aF" 4 /3 ocr 4 (10.43) 

where R 3 has been written for V. The rest energy of the matter of density p m 
also increases but at a slower rate since 

p m c 2 <x V~ 1 «R~ 3 (10.44) 

The two densities were equal (using the estimates of the two densities at present) 
when the total density was 10 -18 kg m~ 3 and the radiation density was greater 
than the matter density at earlier periods in the history of the universe. The 
history of the universe in this region will now be considered before returning to 
the properties of the radiation observed in the universe at present. 

The behaviour of the universe during the first fraction of a second 
(< 10 -44 s) after the big bang can only be a matter for conjecture but at later 
times a plausible theory can be constructed based on nuclear physics and the 
general theory of relativity. After about 10 -44 s the temperature of the universe 
was greater than 10 12 K and the density was greater than 10 17 kg m~ 3 . The 
universe consisted of high energy particles, antiparticles and photons. The 
average energy of a photon at temperature T M is « kT M so photons can produce 
particle - antiparticle pairs if 

kT M » 2 M 0 c 2 (10.45) 

where M 0 is the rest mass of the particle. At temperature above 10 12 K particles 
and antiparticles would be in equilibrium with photons but as the universe 
expanded and cooled below T M , particles of rest mass M 0 would annihilate with 
their antiparticles and not be replaced by pair production. The presence of 
atomic particles in the present universe must be due to an initial small excess 
of particles over antiparticles after the big bang. It has been estimated that the 
present density of matter in the universe is consistent with an initial excess of 
particles over antiparticles of one part in 10 9 but it is not of course possible to 
account for this. 

The temperature of the expanding universe is estimated to have dropped 
below 10 12 K after 10 -4 s. The heavy antiparticles would then all have annihila¬ 
ted but the antielectron (positron) continued to exist for about 10 s by which 
time the temperature had reached 10 10 K. The positrons then annihilated with 
electrons to form photons and left a small residue of electrons. Simultaneously 
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nuclear reactions between protons and neutrons led to the formation of helium 
nuclei. The universe therefore consisted of photons and an ionised gas of pro¬ 
tons, helium nuclei and electrons until the temperature had decreased to about 
3 x 10 3 K some 10 6 years after the big bang. (The role of neutrinos will not be 
considered here.) 

The electrons then combined with the ions to form a gas after which matter 
and radiation cooled separately since the photons could not interact with the gas, 
The equation for an isentropic change of black-body radiation (10.27) may 
therefore be applied between a temperature of 3 x 10 3 K and the present. The 
radiation must be true black-body radiation since it was initially in equilibrium 
with matter. The density of radiation and matter is estimated to be roughly 
equal at 3 x 10 3 K so the present temperature of the radiation may be found 
from equation 10.27 


T 



» 3K 


The original estimates of the temperature of the black-body radiation by Gamow 
and his collaborators were in the range 25 - 5 K in surprisingly good agreement 
with the present best experimental value of 2.99(+ 0.07, - 0.14)K when the 
obvious uncertainties associated with the calculation are considered. Gamow was 
led to the prediction of the background radiation in 1948 while constructing a 
theory of the formation of the elements after the big bang. The theory was 
neglected however after it had been shown to be unable to explain the formation 
of elements beyond helium and the prediction of the background radiation was 
forgotten. Astronomers were not in fact equipped in 1948 to search for isotropic 
radiation which if it had a temperature of 3 K would have a maximum intensity 
in the microwave region near 10 11 Hz. 

The development of radioastronomy should have quickly led to the discovery 
of the background radiation and in fact it probably was observed but was attri¬ 
buted to electrical noise in the system. Dicke independently predicted the 
presence of isotropic black-body radiation with a peak in the microwave region 
in 1964 but Penzias and Wilson made the first known observation of the back¬ 
ground radiation before his apparatus had been completed. 

Penzias and Wilson had designed a sensitive microwave system working at a 
wavelength of 7 cm to receive signals reflected from satellites. The electrical 
noise in the system was found to be above that which had been calculated and it 
was finally realised that a signal from the background radiation had been detec¬ 
ted. The intensity of the background radiation has now been measured at a 
number of frequencies (figure 10.8) and is consistent with black-body radiation 
with a temperature of 3 K. The measurements above the peak at 10 u Hz which 
finally established the nature of the radiation were made in 1975 using balloon 
flights because the earth’s atmosphere is strongly absorbing at these frequencies. 
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v(Hz) 

Figure 10.8 The experimental values for the intensity of the isotropic black-body back¬ 
ground radiation as a function of frequency. The theoretical curve for a temperature 
of 2.99 K is also shown. 


The lower limit to the measurements is set by the presence of strong galactic 
synchrotron radiation at frequencies near 10 8 Hz. 

The intensity of the background radiation is therefore consistent with 
black-body radiation of temperature 3 K and since the radiation is also found to 
be isotropic to the presently available accuracy of 0.1 per cent there seems little 
doubt that it constitutes good evidence for the big bang theory . A further 
improvement in the accuracy of the measurement of the intensity as a function 
of direction in space should show a small anisotropy due to the velocity of 
rotation of the solar system 200 km s *) about the centre of the galaxy. 

The existence of black-body radiation throughout the universe is of immense 
importance to astrophysics because the radiation deifsity at 3 K is very large 
compared to all other energy terms except the rest energy density of matter. 
Peebles has estimated for example that if it were possible to ionise all the atoms 
in the universe at the expense of the background radiation the temperature of 
the radiation would decrease by only 10 —5 K. The presence of the background 
radiation must therefore always be considered in astrophysical calculations. 


10.3 HEAT CAPACITY OF SOLIDS 

The heat capacity at constant volume of a body was defined in section 2.6 
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The heat capacity at constant pressure is normally obtained by experiment and 
then the correction discussed in section 5.2 applied to reduce the results to con¬ 
stant volume. The heat capacity at low temperature is however of particular 
interest and in this region the difference between Cp and Cy can usually be 
neglected. 

The heat capacity of a solid may be thought of as arising from a number of 
independent terms. All solids consist of an array of atoms which vibrate about 
their mean positions. Since the amplitude of the vibrations increases with tem¬ 
perature, there will be a change in the energy of the crystal lattice and a contri¬ 
bution to the heat capacity usually known as the lattice heat capacity. The con¬ 
tribution of the free (conduction) electrons in metals to the heat capacity has 
already been discussed in section 10.1. In addition to these terms any other 
energy term of the solid which changes with temperature will contribute to the 
heat capacity. In this section only one such effect (the so-called Schottky 
effect) will be discussed in detail but a number of other examples are 
considered briefly. 

The atoms of a crystalline solid form a regular array in space. The atoms 
vibrate about their mean positions with an amplitude which increases with 
temperature but even at the melting point the amplitude is much less than the 
interatomic spacing. An oscillation of small amplitude may always be represen¬ 
ted, to a first approximation, by simple harmonic motion but the interaction 
between the atoms of a solid leads to a coupling between the N individual 
oscillators where N is the number of atoms in the crystal. The situation is similar 
to that discussed for the electronic states of a solid in section 6.2. The crystal 
may be thought of as the bringing together of N oscillators each originally with 
the same frequency. As the oscillators begin to interact, the single frequency is 
changed into a range of frequencies which depend upon the strength of the 
interaction. The potential energy of one atom now depends upon the coordin¬ 
ates of all the other atoms in the crystal. 

The N coupled oscillators can however always be transformed by the methods 
of mechanics (for simple harmonic motion) to a set of 3N uncoupled one-dimen¬ 
sional simple harmonic oscillators with certain allowed frequencies known as 
normal modes. The classical value for the lattice heat capacity therefore follows 
directly from the equipartition theorem. Since the mean energy of each simple 
harmonic oscillator is kT the lattice heat capacity is 3Nk or 3 R for one mole of 
solid. This result was in fact first obtained by Dulong and Petit from experiments 
at room temperature. There are however exceptions to the rule (notably dia¬ 
mond) and the classical theory fails completely at low temperatures. 

The quantum theory of the lattice heat capacity involves the behaviour of the 
quantised simple harmonic oscillator and is obviously similar to the discussion 
already given for black-body radiation. There are however two features which in 
general distinguish the two cases and which arise from the discrete nature of the 
crystal lattice. Firstly since there are only 3N allowed normal lattice modes there 
must be a maximum lattice frequency and secondly the properties of a crystal 
may be different in different directions (anisotropy). These distinctions will be 
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seen to be unimportant at low temperatures (because only oscillations of long 
wavelength are then possible) and in this limit a result equivalent to Stefan’s law 
will be obtained. 

The mean energy of a simple harmonic oscillator of frequency v r was found 
in section 7.4 to be (apart from the zero-point energy which is of no interest for 
the heat capacity) 

_ hv r 

e r = - 

Q0 hv r - 1 

so the lattice energy may be written 

E = V —^- (10.46) 

r=l e& hv r - 1 

where the v r are the normal modes of the lattice. 

It is worth noting that a ‘particle’ interpretation can be given to equation 
10.46. In analogy with the photon associated with the electromagnetic field the 
‘particles’ are called phonons. The expression ‘quasi-particle’ is sometimes 
preferred for the phonons, to distinguish them from the atoms of the lattice. 

In the phonon picture there are n r phonons with frequency v r and the mean 
energy is 

E = 2 n r hv r 


where 

1 

n r = - 

e (3hv r _ x 

The analogy between photons and phonons should now be clear. The phonons 
are bosons with no restriction on their number and therefore follow the same 
form of the Bose - Einstein statistics as photons. 

The normal modes of the lattice may as usual be considered to be sufficiently 
close together to be approximated by a density of states function. The sum of 
equation 10.46 may then be transformed into an integral 


E = 


j: 


hv 


e0 hv - i 


D(p) d*> 


(10.47) 


where 


J o D(y)dv = 3N 


(10.48) 
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since the number of allowed modes must be restricted to 3 N. The statistical 
mechanics required for the lattice mean energy and therefore the lattice heat 
capacity is all contained in equation 10.47 but the purely mechanical problem of 
evaluating the density of states function has still to be considered. 

At sufficiently high temperature equation 10.47 reduces to the classical 
result regardless of the form of the density of states function. The exponential 
term may be expanded to give (1 + php) and 


E 



dp = 3 NkT 


(10.49) 


At lower temperatures the form of the density of states function is more 
important but because it occurs inside an integral it will be seen that quite a 
crude model is sufficient to give reasonable agreement with measured lattice heat 
capacities. Conversely it is difficult to get any information about the density of 
states function from measurements of heat capacity. 

The form of the density of states function D(v ) can be found for a given 
solid either by making some assumption about the number of neighbouring atoms 
each atom interacts with and then solving the resulting equations on a computer 
or by various experimental methods, notably neutron scattering (see Kittel 
(1971)). The density of states as a function of wave vector (appendix IV) of a 
crystalline solid may be found from symmetry arguments (group theory) so the 
frequency of a phonon as a function of wave vector is required (the dispersion 
relation) before D(y) can be evaluated. The measurement of the change of energy 
and momentum of a neutron beam scattered by the solid enables this relation 
to be established directly. 

The first quantum calculation of the lattice heat capacity (due to Einstein) 
evaded the problem of the density of states function by treating the lattice as a 
set of N independent oscillators all with frequency p E . The mean energy is 
therefore 


E = 


hv £ 


e^ hv E 


— C D{v) 

- 1 Jo 


dp 


Therefore 


E = 


3Nhp E 


e Phv E _ i 

The lattice heat capacity is 


c = (—\ - f—\ dg - 3Nk (0hP e) 2 e fihVE 
V W )v ” \30 )v d T (e^E _ if 


(10.50) 
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where v E is to be chosen to give the best agreement with experiment for a 
particular solid. An Einstein temperature is defined by the equation 


hv^ = kd E 


and is found to be typically 200 K, corresponding to a frequency of about 
5 x 10 12 Hz which lies in the infrared region of the spectrum. The law of Dulong 
and Petit can now be understood since room temperature is greater than the 
Einstein temperature of most solids, and equation 10.50 becomes 

CV * 3 Nk (T > 0 E ) 

The apparent exceptions to the law of Dulong and Petit arise for solids with an 
unusually high 0 E * Einstein showed that a reasonable fit to the measurements 
for diamond for example (which has a heat capacity about one quarter of the 
classical value at room temperature) was obtained using equation 10.50 with 0 E 
equal to 1320 K. 

At low temperature the Einstein equation is not usually in good agreement 
with experiment. The theory shows an exponential decrease of the heat capacity 
with temperature. 


cy 



(T < e E) 


(10.51) 


but for most solids the lattice heat capacity is found to be proportional to T 3 . 
This discrepancy is not surprising because at low temperature the low-frequency 
( hv r ** kT ) normal modes of the solid are important and these have been 
replaced by one high-frequency mode in the Einstein model. There are solids 
however, in which one particular lattice mode is important, which are quite well 
described by the Einstein model (figure 10.9) or a combination of the Einstein 
model and that of Debye discussed later in this section. 

The theory of the low-temperature lattice heat capacity may be approached 
in a rather general manner because only low energy (long wavelength) modes 
can be excited. The structure of the lattice is therefore not important and the 
dispersion relation becomes simply 


c s = v\ = - (10.52) 

k 

where c s is the velocity of sound in the crystal and k the wave vector. In general 
c s will be different for transverse and longitudinal waves and for different 
directions in the crystal, but these refinements will not be considered here. 

The discrete lattice may be approximated to by an elastic continuum in the 
low temperature (long wavelength) region and the density of states function is 
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Figure 10.9 An example of a solid whose specific heat is well described by the Einstein 
model. The lattice vibrations of AIjqV (upper curve) contain one low-frequency mode 
which dominates the specific heat (Tg = 22 K). A normal metal is shown in the lower 
line (see also figure 4.2). (After A. D. Caplin, G. Griiner and J. B. Dunlop Phys. Rev. 
Lett. 30 (1973) p. 1138.) 


then given by the same form as for black-body radiation (appendix IV) 

3V 

D(k)dk = - k 2 dk (10.53) 

2 it 2 

where the factor of 3 allows for the two transverse and one longitudinal waves 
with wave vector k. This equation leads to an infinite number of allowed modes 
when integrated over all values of k 9 rather than the correct value of 37V, but 
at sufficiently low temperature this is of no importance for the calculation of 
the mean energy of the lattice because the integral (equation 10.47) is effectively 
cut off at high frequency by the exponential term. 

Using equation 10.52 the density of states as a function of frequency 
becomes 


and the energy 
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(10.54) 


(10.55) 
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which is of the same form as Stefan’s law (equation 10.40). The lattice heat 
capacity is therefore proportional to T 3 at sufficiently low temperature; a result 
first obtained by Debye. The T 3 law in fact follows simply from the form of the 
density of states function in three dimensions. The general result for a system of 
n dimensions is T n (problem 10.5). 

The T 3 law is in good agreement with experiment for most insulators at 
sufficiently low temperature (figure 10.10) although highly anisotropic materials 
such as graphite which form in layers are closer to T 2 behaviour. Thin films also 
follow the T 2 law as would be expected for nearly two-dimensional systems 
(figure 10.11). The T 3 law is of course consistent with the third law of thermo¬ 
dynamics, as was discussed in section 4.4. 



Figure 10.10 The thermal capacity of a three-dimensional insulator is proportional to the 
cube of the temperature at sufficiently low temperature. 



Figure 10.11 The thermal capacity of an insulator in two dimensions (a helium film) is 
proportional to the square of the temperature at sufficiently low temperature. The 
upper line is for a film of average thickness (1.13 monolayers) and the lower for 0.90 
monolayers. (After D. F. Brewer J. Low Temp. Phys. 3 (1970) p. 205.) 
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Debye introduced a most useful approximate theory of the lattice heat 
capacity which reduces correctly to the Dulong and Petit law in the high tempera¬ 
ture limit and to the T 3 behaviour at low temperature and is usually a reasonable 
fit to the measurements at intermediate temperature. 

The Debye approximation involves using equation 10.52 for the dispersion 
law and therefore equation 10.54 for the density of states function. These 
equations are however only correct in the long wavelength limit and the Debye 
density of states is quite different from the true density of states function of a 
real solid at higher frequencies as may be seen in figure 10.12. The approxima¬ 
tion remains useful because only the integral of the density of states function is 
required for the lattice heat capacity. 



Figure 10.12 The density of states function in the Debye approximation (dashed) and the 
actual density of states, for a typical solid. 


The density of states function for an elastic continuum given by equation 
10.54 leads to an infinite number of normal modes if used in equation 10.48. 
Debye introduced the discrete nature of the lattice by cutting off the density of 
states function at a characteristic frequency vjy such that 


Therefore 



and the Debye temperature is given by 


(10.56) 


kd d - hv d 


(10.57) 
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The Debye temperature for a given solid can be found directly from the above 
equation (using an appropriate average value for the velocity of sound in the 
crystal) or by finding the value which gives the best agreement with experiment. 
The values of the Debye temperature found by these two methods usually agree 
to within ten per cent, showing that the Debye theory has a certain validity. An 
exact agreement between the Debye-theory and experimental heat capacities 
however is only obtained if the Debye temperature is allowed to change with 
temperature. 

The energy of the lattice in the Debye approximation may be written (from 
equations 10.47 and 10.54) 


where 


E 


12nV f*0D hv 3 dv 
c 3 Jo tP hv - 1 


1271-F 

c 3 h 3 


(kTf 


r 


x 3 dx 
e* — 1 


*d = 


hv p 
kT 


The equation for the energy may be written in terms of the Debye temperature 


9 NkT 


an: 


*D X 3 dx 


(10.58) 


Now in the low-temperature limit (T < 9 0 ) goes to infinity and the integral 

becomes a definite integral with value 7r 4 /15. The energy is then 


- 3jt 4 / T \ 3 

E = - NkT (— CT < 6 D ) 

5 \0d / 

and the heat capacity 

1?7T 4 / T\ 3 

C v = - Nk [ —) (T < 9 D ) (10.59) 

5 \d D J 

The slope of a graph of the measured heat capacity of an insulator as a function 
of the cube of the temperature therefore gives a value for 0 D . It is found that a 
true T 3 law only holds at temperatures below about 0 d/ 5O, which for most 
materials means in the liquid-helium range of temperature. 

The Debye theory reduces to the law of Dulong and Petit in the extreme 
high-temperature (T> 0 D ) limit. The exponential term in the integral of equa- 
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tion 10.58 may be written as simply (1 + x) since x is now small and then 


E = 9NkT 3 j** D x 2 dJt 

= 3NkT (T > 0 D ) 

as required. As in the Einstein model however it is clear that solids with a large 
0 d will not reach the classical limit by room temperature. For example the 
Debye temperature for diamond is 2050 K (compared with the Einstein tem¬ 
perature of 1320 K) and the high-temperature limit is not attained even at the 
melting point (« 3800 K). 

The Debye theory does not provide exact agreement with the measured heat 
capacities of solids in the temperature region between T > 0 D /5O and the high- 
temperature limit. The integral in equation 10.58 has been tabulated and the 
difference between the Debye theory and experiment is usually expressed by 
calculating the value of 0^ at each temperature to give exact agreement with 
experiment. The Debye temperature would of course be a constant if the theory 
was correct but usually first decreases from the value extrapolated to absolute 
zero and then increases, becoming roughly constant at high temperature. The 
variation in 0 D is however usually less than plus or minus twenty per cent of the 
value at absolute zero so a ‘typical’ value provides reasonable agreement with 
experiment. 

The Debye approximation to the density of states has been used to study 
many other phenomena in solid state physics such as compressibility, thermal 
expansion and elastic constants. The values of 0 D for a given system found by 
these different methods are usually in reasonable but not exact agreement. 

The Debye approximation to the density of states function is so simple that 
it is commonly used in solid state calculations over the whole temperature range 
when great accuracy is not required. The prediction from equation 10.58 that 
the lattice heat capacities of all solids lie on a universal curve when plotted as a 
function of r/0 D is in remarkably good agreement with experiment (figure 
10.13) although it has been seen that there is no unique value to take for 0 D for 
a material. A better theory of the heat capacity cannot avoid the labour of 
actually evaluating the true density of states of the normal modes of the lattice 
which, as may be seen from figure 10.12, is quite different from the Debye 
approximation at high frequencies. It is also necessary to go beyond the 
approximation of simple harmonic oscillators to consider the anharmonic terms 
in the potential energy if exact agreement with experiment is to be obtained. 

The contribution of the free (conduction) electrons of metals to the heat 
capacity was briefly discussed in section 10.1. The calculated heat capacity may 
be written 


Cel - — Nk — = yT 

2 r F 



0.4 0.6 



Figure 10.13 Heat capacities of several substances (J mol -1 ) compared with Debye’s theory. 
For the sake of clarity, portions I and III are shown shifted. [After E. S. R. Go pal, 
Specific Heats at Low Temperature (Plenum Publishing Corp., New York, 1966).] 
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(equation 10.18). At room temperature this contribution to the heat capacity 
was seen to be negligible relative to the lattice term with value 3Nk. In the low 
temperature region however the lattice contribution decreases as T 3 and the 
coefficient y can be found by experiment from the intercept on the C/T axis 
when plotted as a function of T 2 . 

- = AT 2 + y 

T 

as shown for example in figure 4.2. The measured values of y provide a direct 
comparison with the predictions of the quantum free-electron theory of metals 
since there are no unknown quantities in equation 10.18 if each of the atoms in 
the crystal is assumed to contribute its valence electrons to the free-electron gas. 

The agreement between the free-electron theory and experiment is normally 
expressed by defining an effective mass for the electrons (m*) by the equation 

m* = Tex 

m 7th 

where 7 th is given by equation 10.18 and 7 ex is found from the measured inter¬ 
cept shown in figure 4.2. The quantity m* is found to be about 1.2 m for 
sodium and 13m for copper showing that the conduction electrons cannot be 
treated as completely free electrons. The effective mass found from heat capacity 
measurements is in fact not a very useful quantity since the effective mass of an 
electron is different in general in different directions in the crystal and m* is 
therefore some kind of average value. 

The electronic contribution to the heat capacity therefore provides only 
limited information about the effective mass of an electron in a crystal lattice 
(or equivalently about the density of electron states at the Fermi energy) just as 
the lattice heat capacity was rather insensitive to the details of the density of 
states of the normal modes of lattice vibration. Heat capacity measurements are 
however relatively straightforward and even the rather limited information 
contained in 0 D or 7 ex is often useful when studying new materials. 

The lattice contribution to the heat capacity is important for all solids and 
the electron contribution is important for all metals (and sometimes for semi¬ 
conductors) but there are many other possible contributions to the heat capa¬ 
cities of particular solids. One other process common to all solids is the heat 
capacity due to the formation of defects in the perfect crystal lattice as the 
temperature is increased from absolute zero. The number of simple defects ( n ) 
in a crystal containing N atoms at temperature T was shown in exercise 6.6 to be 

n = Ne~@ € (n < N) 

where e is the energy needed to form a defect. The defect energy is ne and the 
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heat capacity 


C = — e-P' (10.60) 

In most solids the energy of defect formation is sufficiently large for the defect 
heat capacity to be small right up to the melting point but in the solids formed 
by the so-called inert gases the heat capacity at the melting point is twenty per 
cent greater than the Dulong and Petit value for the lattice heat capacity and the 
excess heat capacity is well described by equation 10.60. 

The defect heat capacity is typical of many possible contributions to the heat 
capacity of solids which depend upon the increase in the disorder of the system 
as the temperature is increased from absolute zero. A binary alloy of two ele¬ 
ments AB for example will (if in true thermodynamic equilibrium) form an 
ordered arrangement ABABAB in the crystal at absolute zero, as required by the 
third law of thermodynamics. At temperatures far above some characteristic 
temperature T c the arrangement of the two types of atom becomes random. The 
entropy (k In W) of the alloy is zero in the ordered arrangement (W= 1) and 
equal to Nk In 2 (where N is the total number of atoms in the alloy) in the high 
temperature limit as in the case of the two-level system considered in section 7.4. 

The order - disorder process makes no contribution to the heat capacity at 
either high or low temperature since T(dS/dT)v is then equal to zero, but leads to 
a peak in the heat capacity at the temperature T c where the long-range order 
vanishes (figure 10.14). 

The order - disorder phenomenon also occurs when the lattice contains atoms 
with spin s. At sufficiently low temperature the spins must form an ordered 



Figure 10.14 The transition from an ordered binary alloy at low temperature to a disordered 
alloy leads to a peak in the heat capacity due to the increase in the entropy of the system. 
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arrangement in space in agreement with the third law of thermodynamics and at 
sufficiently high temperature are distributed at random over the allowed energy 
levels. The change in the magnetic entropy is therefore 

AS = Nk In (2s + 1) (10.61) 

Now the entropy change due to the magnetic process may be found by measur¬ 
ing the heat capacity over a wide temperature range, subtracting the estimated 
heat capacity due to other terms (notably the lattice thermal capacity) and using 
the thermodynamic relation 

A5 = SI Cd(lnD 

The magnetic contribution (as in the order - disorder transformation shown in 
figure 10.14) is in fact largely concentrated near the transition temperature 
(figure 10.15) so the subtraction of the other terms is often straightforward. The 
spin of the particles can therefore be determined from measurements of the 
thermal capacity and equation 10.61. 



Figure 10.15 A peak is observed in the heat capacity at the transition from ferromagnetism 
to paramagnetism (the Curie point). The measured heat capacity of nickel is shown and 
the estimated contributions of the lattice (Cj), electronic (C e ) and magnetic (Cm) con¬ 
tributions. [After E. S. R. Gopal, Specific Heats at Low Temperature (Plenum Publishing 
Corp., New York, 1966).] 


One further contribution to the heat capacity of solids will be considered 
although this does not in any way exhaust the possibilities. A particle with spin s 
may be sufficiently weakly coupled to its neighbours for interparticle interactions 
to be negligible even at low experimental temperatures. The order - disorder 
phenomenon will not now be observed since the particles are only weakly inter¬ 
acting. One example of this type, the two-level system (s was considered in 



Further Applications 


241 


sections 6.3 and 1.2. A magnetic field splits the single allowed energy level into 
two levels separated by a distance A. At low temperature all the particles are in 
the lower level and at high temperature N/2 are in each level. The total entropy 
change is therefore Nk In 2 and the heat capacity shows a peak, as may be seen 
from equation 7.45 and figure 10.16. 



Figure 10.16 The change in the populations of the states of a weakly interacting (2/ + 1)- 
level system leads to a broad maximum in the heat capacity. The heat capacity of 
holmium is shown. At normal helium - temperatures it is only possible to observe the 
term in T ~ 2 (equation 10.63) but in this example the combination of a large magnetic 
field at the holmium nucleus and very low - temperature techniques allowed the whole 
curve to be measured. (Communicated by O. V. Lounasmaa.) 


Nklt 2 A 2 e (3A 

(1 + e ^ A ) 2 


(10.62) 


where 


A = 2jJL 0 H* 

A peak of this type is called a Schottky anomaly. Notice that the peak is much 
broader than that of the order - disorder transition because the Schottky effect 
arises from each spin individually interacting with the magnetic field, but the 
order - disorder transition involves the interaction of all the particles in the 
system (as in the solid-liquid transition, section 10.4). 

The theory of the two-level Schottky effect (s = 1/2) can obviously be 
generalised to the case of spin s where the particles are distributed over (25 + 1) 
levels and comparison of theory and experiment provides a method for deter¬ 
mining s. Alternatively if the magnetic moment (ji ) is known for a nucleus or ion 
the magnetic field at the site may be found (as was noted in section 7.4). 
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Hie theory of the Schottky effect at high temperature (kT> A) leads to the 
result 


C = 



— = DT~ 2 

rp 2 


(10.63) 


(exercise 7.2). The magnetic moment of a nucleus is so small that even at 
temperatures below 4 K the Schottky effect is usually only observed in the high- 
temperature limit. The total heat capacity would then be written 


C = AT 3 + yT + DT~ 2 

and the Schottky effect is observed as an increase in the heat capacity as the 
temperature decreases (figure 10.6). 

Hie discussion of this section should have made clear both the importance 
and the limitations of the information which can be obtained from heat capacity 
measurements and in particular the importance of measurements in the liquid- 
helium temperature range below 4 K. In general heat capacity measurements are 
used to established approximate values for such quantities as the effective mass 
of the conduction electrons in a metal or the spin and spacing of the energy 
levels of particles in a given solid. These values are then used in the design of 
other experiments (such as resonance techniques) which are capable of an 
accuracy many orders of magnitude greater than that possible in heat capacity 
measurements. 


10.4 PHASE TRANSITIONS 

The isotherms of a pure substance were shown in figure 2.6 as a function of 
pressure and volume. The equivalent isotherms as a function T)f pressure and 
density are shown in figure 10.17. At high temperature and low density the 
isotherms are identical to those of a perfect gas but below the critical temperature 
T c it is possible to liquefy the gas by applying sufficiently high pressure. The flat 



Figure 10.17 The isotherms for gas and liquid as a function of pressure and density. The 
critical isochore is shown as a dotted (vertical) line. 
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region of the isotherms (along which liquid and vapour are in equilibrium) re¬ 
duces to a point of inflexion on the critical isotherm. The critical isochore is 
shown by a dotted line in figure 10.17. 

The pressure - temperature diagram is also of interest (figure 10.18). Vapour 
and liquid are in equilibrium along the line ab but above T c it is not possible to 
convert vapour to liquid by an increase of pressure and therefore the line stops 
at b. Along oa vapour and solid are in equilibrium. The solid - liquid equilibrium 
along ac does not appear to have a critical temperature so, unlike ab, ac extends 
without limit. The intersection of the three curves represents the triple point on 
the P- T diagram. 



Figure 10.18 The pressure - temperature diagram for phase equilibria. 


10.4.1 First-order phase transition: Clapeyron’s equation 

The solid - liquid, solid - vapour and liquid - vapour (except at the critical point) 
phase transitions are accompanied by changes of density or, equivalently, volume 
per unit mass (u) and by a latent heat (/). The latent heat is related to the change 
of entropy of the system by equation 3.26 

l 

s 2 s i - 

T 

where T is the temperature of the transition and s is the entropy per unit mass 
of a phase. 

The first-order derivatives of the Gibb’s free energy for unit mass therefore 
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changes discontinuously since 


d A\ 

dT/p 


— s and 



v 


(10.64) 


and the phase transition is called first order. The classification of phase transi¬ 
tions by the lowest derivative of the Gibbs function which changes discontin¬ 
uously is due to Ehrenfest. The liquid - vapour transition at the critical tempera¬ 
ture and pressure involves no change in volume and is therefore an example of a 
higher-order phase transition (section 10.4.4). 

The equation of the lines shown in figure 10.18 was first derived by Clapeyron. 
A phase change occurs at constant pressure and temperature and under these 
conditions it has already been shown (chapter 4) that the Gibbs function for the 
system must be a minimum. When n x moles of phase 1 exist in equilibrium with 
n 2 moles of phase 2 the total Gibbs function is 

G = n x g x + n 2 g 2 

where as usual g refers to unit mass. A change of n x by d n requires a charge of 
- d n in n 2 to conserve the total mass of the system and therefore 

dG = {g x - g 2 )dn 

d G must be zero at equilibrium, since G is required to be a minimum, and there¬ 
fore^ = g 2 . 

The Gibbs functions per unit mass of each phase in equilibrium are equal. 

A small change of temperature and pressure along the coexistence curve will 
lead to equal changes in g x and g 2 



which using equation 10.64 becomes 


/9P\ = s 2 -s x 

\dT / a v 2 — V\ 

or since the latent heat l = (s 2 - )T 


(10.65) 


-) - ' 

\dTJa T(V 2 ~ V X ) 


( 10 . 66 ) 


where the subscript o means that the differential is to be evaluated along the 
coexistence curve. It should be remembered that s, l and v must all refer to the 
same mass of substance, v is therefore the reciprocal of the density. 
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A normal solid is more ordered, has lower entropy, than the corresponding 
liquid and / is therefore positive, that is heat is needed to change a solid at its 
melting point to a liquid. However an exception to this rule is discussed in 
section 10.4.3. The latent heat is never negative for the liquid - vapour transition. 

The sign of the slope of the liquid-solid coexistence curve, ac in figure 10.18, 
is determined by the density of the liquid and solid phases at the melting point. 
The solid is usually denser than the liquid and therefore dP/dT is positive, as 
shown in figure 10.18. A well-known exception to this behaviour is water, for 
which dP/dT is negative, since ice is less dense than water, and ice therefore 
tends to melt under increased pressure. 


10.4.2 The liquid - vapour transition. Integration of Clapeyron’s equation 

An approximate relationship between the pressure and boiling point of a 
liquid may be derived by integrating equation 10.66. The density of a vapour, 
away from the critical temperature, is much less than that of its liquid (v y > u/). 
Assuming that the equation of state of the vapour may be approximated by the 
perfect gas law, equation 10.66 becomes 


dP l IP 
dT ~ To y ~ RT 


Then if / is independent of temperature 


In 


KKG- t) 


(10.67) 


The vapour pressure therefore varies exponentially with temperature. Equation 
10.67 is useful for rough calculations but is improved by using a virial expansion 
for the function of state and also by allowing for the temperature dependence 
of the latent heat. The simple form 


l = l Q - aT 


where l 0 and a are constants, is often used. 


10.4.3 Pomeranchuk cooling 

In 1950 Pomeranchuk proposed a new method for the attainment of very low 
temperature which has since been used (1973) to cool 3 He (section 10.1) to 
0.002 K, 2 mK, above absolute zero. The principle of the method may be 
understood using only Clapeyron’s equation (10.65) and the P- T diagram for 
3 He at temperatures below 300 mK (figure 10.19). 

The remarkable feature of figure 10.19 is that the P- T curve has a minimum. 
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Figure 10.19 The phase diagram of 3 He at low temperature, close to the melting curve. 
(After W. J. Huiskamp and O. V. Lounasmaa, Rep. Prog. Phys. 36 (1973) p. 423.) 


dP/dT is negative at temperatures below this minimum, but (v\ - v s ) remains 
positive. The entropy of the solid is therefore greater than that of the liquid 
(equation 10.65) and an isentropic increase of pressure on the liquid leads to a 
decrease in the temperature of the system. 

The P-T and S-T diagrams are shown for temperatures below 30 mK in 
figures 10.20 and 10.21. An isentropic increase of pressure from X to Y at a 
starting temperature of 25 mK would lead to a final temperature of 2 mK. At 
2 mK all the liquid has been converted to solid and the cooling process is com¬ 
plete. It is clear from the S- T diagram that the starting temperature must be 
far below 318 mK, where the solid and liquid entropies are equal, if appreciable 
cooling is to be obtained. Temperatures in the 20 - 30 mK region can now be 
reached fairly easily however using 3 He - 4 He refrigerators. 

The main features of Pomeranchuk cooling have been discussed so far using 
only thermodynamic arguments but statistical mechanics is required to explain 
the shape of the S-T curves. 3 He is a Fermi particle (section 10.1) and the 
liquid is an example of a Fermi liquid with T F = 450 mK. In section 10.1 it was 
shown that the entropy of a Fermi gas is proportional to the temperature for 
T<T f since 


S = J y dr = 7 Jd T = yT 


(10.68) 
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T (mK) 


Figure 10.20 The entropy of solid (S s ) and liquid (S\) 3 He along the melting curve. The full 
nuclear spin entropy is marked in the figure. (After W. J. Huiskamp and O. V. Lounasmaa, 
Rep. Prog. Phys. 36 (1973) p. 423.) 



T( mK) 

Figure 10.21 The phase and entropy diagrams of 3 He demonstrating Pomeranchuk cooling. 
(After W. J. Huiskamp and O. V. Lounasmaa, Rep. Prog. Phys. 36 (1973) p. 423.) 

and this result also appears to hold for a Fermi liquid (as shown in figure 10.21). 
The entropy of the solid however is largely due to the disorder of the nuclear 
spins of 3 He. Since I =\for 3 He the ‘high-temperature limit’ of the entropy is 
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R In 2 per mole (section 6.3) and the entropy decreases exponentially to zero 
at low temperature. The liquid entropy reaches the value R In 2 at 318 mK, the 
minimum in the coexistence curve, when dP/dT and (s s - s{) are equal to zero. 

Pomeranchuk cooling is the obvious method to use when studying the 
properties of 3 He at low temperature since the liquid cools itself, but it is more 
difficult to cool other materials because solid 3 He forms in the warmest part of 
the cell and thermally isolates the sample. The other main experimental diffi¬ 
culty of the method is the necessity for pressure changes to be made reversibly if 
frictional heating is not to outweigh the Pomeranchuk cooling. Two good 
references to this important topic will be found in the reading list. 


10.4.4 Critical point exponents for higher-order phase transitions 

The density of liquid (pj) and gas (p g ) along the coexistence curves shown in 
figure 10.17 are different below the critical point. The parameter ( p { — p g ) 
which vanishes at the critical point is called the order parameter for the 
liquid - gas transition. In recent years it has come to be realised that many other 
higher-order transitions (for example ferromagnetic, superconducting, super¬ 
fluid) can be characterised by an order parameter and that all these transitions 
have features in common, sufficiently close to the critical point. The analogy 
between the liquid - gas transition at the critical point and magnetic transitions 
is particularly close. The apparent simplicity of the behaviour of systems close 
to the critical point which will be discussed in this section, has led to much 
theoretical work in recent years. The statistical mechanics of the critical region 
is too difficult to discuss in this book but one derivation will be given of a most 
important thermodynamic result. 

The modern study of critical phenomena may be said to have begun in 1945 
when Guggenheim showed that measurements on simple gases below the critical 
temperature were well described by a law of corresponding states of the form 


Pi ~ Pg 
2p c 




(t^ T c J 


where A and 0 are constants and p c is the critical density. The constant ft is 
called a critical point exponent. The expression T c _ means that the temperature 
approaches the critical point from below. The law of corresponding states was 
seen in section 2.2 to be predicted by any two-parameter equation of state such 
as the van der Waals equation but the experimental value-of ]3 is found to be 1 /3 
rather than 1/2 as predicted for a van der Waals gas. 

The approximate equations of state discussed in section 2.2 are therefore 
inadequate in the critical region and many attempts have been made to construct 
more realistic equations of state. 
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The isothermal compressibility has already been seen to diverge at the critical 
point since 


JL ( 3F\ _ _1 /3 p\ 
V \dpJ T p [dpJr 


(10.69) 


and (dP/dV) T is zero at the critical point. The van der Waals equation predicts 
that above the critical temperature but on the critical isochore (figure 10.17) 


K 


T 



( r -+r c+ ; P = Pc) 


(10.70) 


with y equal to unity. At temperatures below T c the isothermal compressibility 
has to be defined for each phase separately and 


Kf 



(t^t c _>p “ pl ° r pg) 


along the coexistence curve. The van der Waals equation leads to y = y' = 1 but 
the experimental results suggest that y ' is about 1.2 for most gases and is not 
necessarily equal to y. The standard notation of primed exponents ( y r ) for 
temperatures below T c should be noted; |3 however is an exception to the rule. 

The experimental specific heat of a system at constant volume along the 
critical isochore (figure 10.22) may be written 

CV = - -4* In I 1 —— 1 + 5* (10.71) 

T c 



ln|7--r c | 


Figure 10.22 The heat capacity at constant volume of argon along the critical isochore 
appears to go to infinity at the critical temperature. 
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where A and B are constants. The specific heat at constant volume therefore also 
appears to go (rather slowly) to infinity. 

A more general form for equation 10.71, which below T c involves the critical 
point exponent a', follows from the definition 


a = lim 




/ in cy \ 

\ln I T - T c I j 


iP = Pc) 


(10.72) 


The equation 10.72 reduces to equation 10.71 for the special case of a ' equal to 
zero. The experimental values for a' are close to zero (less than 0.2) and for the 
analogous magnetic transition are definitely negative (« - 0.2). The van der 
Waals prediction is a = a ' = 0 but the experimental values above and below the 
magnetic transition are not found to be equal. 

The other critical point exponents should also be understood as arising from 
limits defined in the manner of equation 10.72. For example the equation (10.70) 
for the isothermal compressibility is not to be understood as exact (hence the « 
sign) but as indicating that sufficiently close to T c the divergence of the term in 
7 is stronger than that of any other term. 

The decision about the size of the region which is ‘sufficiently close’ to T c is 
often difficult to make when analysing experimental results. At very small 
values of ( T c — T)/T c the choice of T c may have an appreciable effect on the 
value found for the critical point exponent and at larger values the measurements 
may no longer be within the critical region. 

Accurate measurements are difficult to perform in the critical region since the 
liquid - gas transition is found to be extremely sensitive to the effects of gravity 
(due to the large value of the isothermal compressibility) and the time for the 
system to come to thermodynamic equilibrium also becomes very long as the 
critical point is approached. The infinities associated with the critical point 
exponents, such as equation 10.70, are not of course observed by experiment, 
partly because no real system has a uniquely defined value for T c , due to 
impurities (although in favourable cases such as superconductivity the transition 
may take place within a range of 10 -3 K) but also for the more fundamental 
reason that equation 10.70 only applies in the thermodynamic limit. In a finite 
system the transition will always be ‘rounded’ close to T c due to the effect of 
fluctuations. 

The calculation of the critical point exponents using the methods of statistical 
mechanics is a difficult branch of current research and will not be considered 
here. A number of inequalities obeyed by the various critical point exponents 
can however be derived using only thermodynamics and as an example the 
inequality 


a + 20 + y > 2 (10.73) 

will be considered. These inequalities are obviously of great importance since 
they represent a test of the properties of any model of the phase transition used 
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by statistical mechanics and are also a useful check on the experimental values 
of the critical point exponents. It appears possible in fact that equation 10.73 is 
an equality but no rigorous proof has yet been given. The original proof of the 
inequality was given by Rushbrooke (1963) for the analogous magnetic transition 
and was extended to the liquid-gas transition by Fisher (1964). 

The proof of the inequality involves the construction of an exact equation 
for the specific heat at constant volume of a system along the critical isochore. 

All the terms of this equation are then recognised to be positive and by exclud¬ 
ing all but one of these terms the inequality is established. The proof is fairly 
long, but straightforward, and is given in some detail since it is a good example 
of the power of thermodynamics. 

The system under consideration has total mass M contained in a constant 
volume V. Since the system is to be heated along the critical isochore the 
relationship between M and V is determined by the equation 

Mv c = V 

where v c is the volume of unit mass of substance at the critical point (the recipro¬ 
cal of the critical density). At any temperature below T c the system will have a 
mass m\ in the liquid phase with volume per unit mass and mass m % in the gas 
phase with volume per unit mass u g where 

m\ + m g = M 

+ m g v g = V = Mv c (10.74) 

The mole fractions in the two phases may therefore be written 


x\ = 


m i 

= Ol. 

- v c 

(10.75) 

m\ + m g 

v s 

- V { 

m g 

_ 

- V\ 

(10.76) 

m { + m g 

v e 

- V\ 


The effect of an increase in temperature 5 T is to transfer a mass dm from the 
liquid to the gas phase. The specific heat of the liquid phase will be written C a l 
where the subscript o means along the coexistence curve. Then the total specific 
heat at constant volume of the whole system ( Cy ) will be given by the equation 

(m x + m % )C v 8T = (m x C a l + m g C a g )8T + 18m (10.77) 
where / is the latent heat. 

The quantity 8m may be found from the condition that the total volume 
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remains constant using equation 10.74 


/3u,\ 

m\v i + m g v g = (mi — 6m) [i>i + v\ I — J 

\dT J a 

Therefore 

mi (—\ + m, 


5 r] + (m g + 8m) [u g + 




8T] 


/duA / bvA 

( — I + ( —- ) 

\bTjo \ bTjc 


8m = 


8T 


v x - u g 


(10.78) 


On substituting for 8m using equation 10.78 for / using the Clapeyron equation 
10.66 and for X\ and x g , equation 10.77 becomes 

cv = x i c ° +x * c ° t - T (^) o [ Xl + J (1079) 


The quantities C a l and C a g now have to be eliminated from equation 10.79. In 
one phase the standard thermodynamic relation 


leads to 


Therefore 



where the Maxwell relation has been used to eliminate (dS/dV) T . 
The term (dP/bT) v may be transformed by the equation 


to give 


C 0 - C v 


(dP\ _ /3P\ /3P\ /dV\ 

[dr Jo “ (drJr \dvJr\drJo 

r( dP ) ( dV ) 2 + r( dF ) ( dV ) 
\dvJ T \dT) a \bT Ja \dT Jo 


(10.80) 
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This equation applies to each phase separately so on reintroducing the super¬ 
scripts 1 and g and substituting into equation 10.79 it is found that the second 
term in equation 10.80 cancels leaving 


Cy = X{Cy^ + XgCyg — T 


Lffl (^yfeyi 

L V3ui/r UTJa Vaug/rVar Jo J 


It is now convenient to introduce the isothermal compressibility of each phase 
by the usual definition (equation 10.69) and to substitute the density of each 
phase for the reciprocal of the volume per unit mass. Therefore 


cy=*i<V +x 6 cy g + X{T 

Pi 3 k-t 1 


( 2 + ( ap g \ 2 

W/* p g 3 kt % \a t )„ 


(10.81) 


This is the final (exact) expression for the specific heat of the whole system at 
constant volume along the critical isochore. The reason for introducing Cy and 
Cy e is that specific heats at constant volume are always positive since (equation 
7.18) 


c _ [(E - E)] 2 
V kT 

and all the terms on the right-hand side are positive. The third term on the 
right-hand side of equation 10.81 is also positive so the inequality 


Cr > X * T 2 
p g 3 K T g \dT/ 0 


(10.82) 


has been established. As the critical point is approached at the constant total 
density (p c ) the value of x\ approaches 1 / 2 , (u c = (v\ + u g )/ 2 ) and p g approaches 
p c . The isothermal compressibility and (3p g /3 T) a however, diverge near the 
critical point since 



T) 1 


and hence 


In C v > (2 - 2(3 - 7 ') I In (T c - T) I + ... (T^ Tq J 


Therefore 


In Cy 

I In (T c - T) I 


> 2 - 2/3 - 7 ' 
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The higher-order terms vanish in the limit and the left-hand side is then the 
definition of a' so 

a + 2/3 + y' > 2 (10.83) 

There have been a number of attempts to show that on the basis of further 
assumptions (the Scaling laws) the equality is in fact correct. The van der Waals 
equation satisfies the equality (a' = 0, p = 1/2, y* = 1) but the agreement with 
experiment for P is so poor that this result has little significance. The values of 
the critical exponents for a number of idealised theoretical models have however 
been calculated with great accuracy and do seem to support the equality. The 
equality is also compatible with experiment although this is not a rigorous test 
since y 1 cannot be measured to high accuracy. 

There are many other thermodynamic inequalities but only one further 
example will be given here. The Griffiths inequality (1965) may be written 

ol + 0(1 + 8) > 2 (10.84) 

where a ' and P have already been defined and 8 is defined by the variation of 
(P - P c ) with (p — p c ) along the critical isotherm T c . The early measurements on 
4 He in the critical region were found to be in disagreement with this inequality 
(a' = 0.017, P = 0.354,3.8 < 5 <4.1) but further work suggests that a' « 0.15 
leading to a value of 2.01 for the left-hand side of equation 10.84. The magni¬ 
tude of the change in the experimental value of a ' should be sufficient evidence 
of the difficulty of measurements near the critical point and of the Importance 
of the thermodynamic inequalities as a check on such measurements. A full 
account of the theory of the critical region may be found in the recent book by 
Stanley (1971). 


10.5 NEGATIVE TEMPERATURE 

The thermodynamic temperature ( T ) was introduced in section 3.1 as the 
integrating factor for the imperfect differential cTg R such that the perfect 
differential d S was defined by the equation 


d S 


cT<2r 

T 


The thermodynamic temperature may then be written 



and has been treated as an inherently positive quantity up to this point. 

The temperature also appears as a positive quantity in statistical mechanics 
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since the partition function is written 

Z = 2 e~® E r 

r 

where j3 = IfkT. There is no upper limit to the allowed energy states of a system 
such as the simple harmonic oscillator and therefore fi must remain positive if 
the partition function is to be finite. 

The possibility of negative temperatures can only exist for systems with an 
upper allowed energy level of finite energy for which the partition function 
would remain finite if j3 became less than zero. In the strictest sense it is easy to 
see that no such system can exist and that in true thermodynamic equilibrium 
the thermodynamic temperature must be positive but it will be shown that it is 
sometimes possible to produce a state in one aspect of a system which may be 
defined by a negative temperature. 

The theory of the thermal capacity of solids discussed in section 10.3 
assumed that the energy of a crystal may be written as the sum of a number of 
independent terms, as for example 

E = Ei + E e + E s + ... (10.85) 

where E x is the energy due to lattice vibrations, E e is the energy of free electrons 
and E s is the Schottky energy due to the splitting of energy levels. The lattice 
energy is proportional to the temperature in the high-temperature (T> 0 D ) 
limit and in this sense there is no upper limit to the allowed energy of the 
crystal. (Above the melting point of course the crystal ceases to exist but the 
energy of a liquid and a gas also have no upper limit.) In thermodynamic equili¬ 
brium all the terms on the right-hand side of equation 10.85 must be described 
by the same temperature and since the lattice energy is restricted to positive 
temperatures only positive temperatures are possible, in agreement with the 
assumptions made earlier in the book. 

A more general definition of temperature however does admit the possibility 
of negative temperatures. The two-level system for spin 1/2 in a magnetic field 
[or in general the (21 + 1) - level system for spin /] satisfies the condition that 
there exists a finite upper energy level. The nuclear spins in certain insulating 
crystals (for example LiF) are found to establish internal equilibrium in times of 
the order of 50 /is (the spin - spin relaxation time) but to require some 300 s to 
come into equilibrium with the rest of the crystal (the spin - lattice relaxation 
time). It is therefore legitimate to treat the nuclear spins as isolated from the 
lattice and in internal equilibrium with a spin temperature (T s ) for times long 
compared to 50 /is and short compared to 300 s. 

A spin temperature T s may be defined if the probability of occupation of a 
spin state of energy E r is given by 

Q—Ps^r 

P r — 

2 Q-PsEr 
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where as usual |3 S = 1 jkT s . This condition is always satisfied for the two-level 
system and the equation for the number of spins in the upper state (separated 
by an energy A from the ground state) 


N i 


N 

1 + e^ sA 


( 10 . 86 ) 


may be thought of as defining fi s . The number of spins in the upper state tends 
to N/2 as T s goes to plus infinity (j3-* 0+ ). If N x can be made greater than N/2 
the system must be described according to equation 10.86 by a negative value of 

ft* 

The experimental method for attaining negative spin temperatures is to place 
the sample in a large steady magnetic field B 0 . Then 

A = 2jji 0 H* 

where fd is the nuclear moment (section 7.4). The spin temperature is then equal 
to the lattice temperature (T L ) and N x (<N/2) is given by equation 10.86 with 
p s equal to fi L . The spins cannot follow a sufficiently rapid reversal of the field 
ju 0 ff* to - iXoH* which then leaves N x >N/2 and the spin temperature equal 
to — 7 l. 

The population of the upper state given in equation 10.86 may be written in 
the symmetrical form 


N N (l — e^ A ) 
2 2 (1 + e^ A ) 


(10.87) 


since a change from fi s to — j3 s changes the sign of the second term (figure 10.23). 
Notice that the equal population of the two levels corresponds to equal to 
zero, that is to say to T s equal to both plus and minus infinity. Negative tempera¬ 
tures correspond to higher energies (E s >NA/2) than positive temperatures and 
it is logical to describe them as ‘hotter’ than any positive temperature. The third 
law of thermodynamics is unaffected by the possibility of negative temperature 
because the spin system returns to the lattice temperature via an infinite tem¬ 
perature (N = N/ 2) not via absolute zero (figure 10.23). 

The spin system can only be brought into the region of negative temperature 
by a rapid reversal of the magnetic field. A quasi-static change of the field would 
leave the spins with the temperature of the lattice since the time involved would 
be greater than the spin - lattice relaxation time. A Carnot engine cannot there¬ 
fore be constructed to work between reservoirs with temperature of opposite 
sign. 

A spin system at positive temperature will tend to absorb an alternating 
magnetic field at frequency (v 0 ) such that 


hv o = A = 2jU 0 i/* 


( 10 . 88 ) 
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Aft 


Figure 10.23 The temperature of a two - level system may be defined in terms of the 

population of the upper level if the spin - spin relaxation time is much shorter than the 
spin - lattice relaxation time. A system with negative temperature (A) passes to positive 
temperature (B) via (3 S equal to zero ( T s = ± «>) not via T = 0 K. 


since the probability of a spin being excited by the field to the upper level by 
the absorption of a photon is equal to the probability that a spin in the upper 
state will be induced to emit a photon (stimulated emission) and drop to the 
lower level. The power absorbed is therefore proportional to the difference in 
the populations of the levels (for a given system and alternating field) 

Pec (N 0 - N x ) 

A system with a negative temperature (N x >N 0 ) has a negative power absorp¬ 
tion, that is to say amplifies the signal at frequency v 0 , but this response will 
only continue until the populations of the two levels become equal. 

An amplifier which can operate continuously may be constructed if some 
method is found of repopulating the higher energy level. A maser (Microwave 
Amplification by Stimulated Emission of Radiation) depends upon this principle 
One type of three-level maser is illustrated in figure 10.24. The energy separation 
(A) of the levels 1 and 2 is arranged to satisfy equation 10.88 for the signal 
frequency p 0 by adjusting the magnetic field ix 0 H *. The upper state 3 is now 
populated by an alternating magnetic field at the pump frequency (i> p ) given by 

hv p = A 13 


(10.89) 
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Figure 10.24 In one version of a three-level maser the energy state 2 is populated by 

exciting the state 3 with a pumping oscillator. A signal with frequency given by equation 
10.88 is then amplified. 


where A 13 is the energy separation between the states. The population of state 
3 can be raised to half that of the equilibrium populations of states 1 and 3 by a 
sufficiently strong pumping field. The population of state 2 depends upon the 
rate at which it is populated from state 3 and the rate at which transitions are 
made from 2 back to the ground state. A sufficiently fast decay from state 3 to 
state 2 will leave N 2 greater than N x and the signal at p 0 will be amplified. The 
states 1 and 2 are sometimes described by a negative temperature but it is clear 
that this is no more than an expression of the relative populations of the two 
states and has little relation to the spin temperature discussed earlier which 
corresponded to internal equilibrium between the spins. 

The maser is used in the microwave region (3 - 30 GHz) as a low-noise 
communications amplifier. The required energy separation of states 1 and 2 at 
these frequencies (10 -5 -10 -4 eV) is obtained by applying a magnetic field to a 
system of electron spins rather than the nuclear spins discussed earlier in this 
section. A low concentration of Cr 3+ ions in alumina (ruby) has been found to 
be a suitable spin system. The ruby maser is usually operated at 4 K since the 
noise in the system is mainly thermal noise which is proportional to the absolute 
temperature. The other contribution to the noise is due to spontaneous emission 
from state 2 to the ground state but at microwave frequencies this term is small 
relative to thermal noise. However the laser (light amplification by stimulated 
emission of radiation), which operates on the same principle as the maser but at 
much higher frequencies, is a noisy amplifier since the noise due to spontaneous 
emission is now much greater than thermal noise. 
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10.6 MAGNETISM 

The study of the magnetic and dielectric properties of materials is one of the 
most important applications of the methods of thermodynamics and statistical 
mechanics. Magnetism is inherently a quantum phenomenon however and a full 
description of the magnetic properties of solids requires a greater knowledge of 
quantum mechanics and solid state physics than has been assumed in the rest of 
this book. Only the simplest statistical model will therefore be treated in this 
section, although the model will be seen in section 10.6.5 to be sufficient to 
explain the principle of two important applications in low-temperature physics. 
The thermodynamics of magnetism is also a difficult topic and only an outline of 
the problem is given in section 10.6.3. 

10.6.1 Simple paramagnetism of localised spins 

The simplest example of a paramagnetic substance is an array of N s localised 
particles each with spin \ in a volume V. The interaction between the spins and 
between a given spin and the crystal lattice must be sufficient to allow the 
system to come to thermal equilibrium but has no other effect on the individual 
spins. In the next section it will be seen how this description has to be modified 
when dipolar forces between the spins are important. 

The spin--| system has already been briefly discussed in section 7.4. The 
single-particle energy level is two-fold degenerate in zero external magnetic 
field and splits into two non-degenerate levels with energy iiiiqH* and — jUj u 0 H* 
when the sample is placed in a uniform field with value where ju is the 
dipole moment of the particle. It should be immediately apparent that this 
description is inadequate for a real solid as 7U 0 K since the entropy of the 
system in zero external field would then be N s k In 2 but if the spins formed an 
ordered arrangement in space the entropy would go to zero. When the spins in a 
region of the sample are all aligned parallel to each other, for example, the system 
is said to be ferromagnetic. 

The interactions between the spins cannot therefore be ignored except at 
sufficiently high temperature where the interaction energy between two dipoles 
a distance a apart is much less than the thermal energy 

2 

^ kT (10.90) 

4im 3 

This condition is obviously best satisfied for particles of small dipole moment 
which are widely separated in space. 

The dipole moment of a nucleus is of order 

eh 

47rm p 


M N 


= 5.0 S x 10~ 27 JT- 1 
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where m p is the mass of the proton. (The values for particular isotopes vary 
between 5ju N and zero but these distinctions will not be considered here.) The 
two sides of equation 10.90 are equal at a temperature of 7 x 10 -9 K for a 
system with a moment of jii N on each site, that is a spacing of about 0.3 nm. 

A system in which the magnetism is entirely due to the nuclei (a nuclear para- 
magnet) therefore satisfies the inequality under all normal circumstances. 

The dipole moment of an atom or ion with spin \ is of order 


Mb = 


eh 

471772 * 


where m e is the mass of the electron and the inequality is now satisfied for 
temperatures greater than 0.02 K if a spin is placed on each lattice site. This 
result is however seriously misleading because when the wave functions of 
adjacent atoms overlap an interaction of quantum mechanical origin called the 
exchange interaction becomes far more important than the dipolar interaction. 

In iron for example the exchange interaction is sufficiently strong for the system 
to be ferromagnetic at temperatures below 10 3 K. 

A low-temperature electronic paramagnet must therefore be formed by 
diluting the N s magnetic ions with N non-magnetic ions. A famous example of 
such a crystal which will be discussed in section 10.6.4 is cerium magnesium 
nitrate (CMN) which has the formula 2 Ce(N0 3 ) 3 .3 Mg(N0 3 ) 2 . 24 H 2 0. Only 
the cerium ions have a magnetic moment so the dilution is sufficient to prevent 
the formation of an ordered system down to temperatures below 10 mK. 

The magnetic properties of the simple paramagnet can now quickly be 
deduced but to stress the specialised nature of the results, the subscript ni for 
non-interacting spins will be used throughout this section. 

The total magnetic moment of the sample in a uniform external field, IXqH* 
is simply equal to the product of the moment of one particle and the difference 
in the number of spins in each level 


jffl ni “ -^sM 
A =2piu 0 H* 


Vl * e-«V 


(10.91) 


Hie magnetisation is defined as the magnetic moment per unit volume of sample 

/I - e-<*\ 

Mni = ( 1 + ) (10.92) 


and the isothermal susceptibility is defined by the equation 
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Figure 10.25 The magnetisation of a simple paramagnet as a function of x = juMo H*/kT. 


The form of equation 10.92 is shown in figure 10.25. In the high-temperature 
limit 


M ni 


n s (x 2 n 0 H* 

kT 



Xni 


«sM 2 Mo = C 

kT T 


(10.93) 

(10.94) 


a result known as Curie’s Law. The value of Cis « 03(n s /n)K for an electronic 
paramagnet and 9 x \0~ 8 (n s /n)K for a nuclear paramagnet so the magnetic 
susceptibility is small at normal temperature. A system which obeys equation 
10.93 is said to be an ideal paramagnet 

The susceptibility is independent of the external field in the high-temperature 
region where Curie’s Law applies but this is not generally true. The next term in 
the high-temperature expansion of equation 10.91 for example leads to 

n s n 2 ii 0 H* _ n s i/(x 0 3 H* 3 
kT 3k 3 T 3 

n*Ii 2 lXQ _ « s M 4 Mo 3 #* 2 
kT k 3 T 3 

The susceptibility is now a function of both temperature and field but from 
considerations of symmetry it is clear that the susceptibility will always be an 


M nl = 

Xni = 
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even function of the external field and the magnetisation an odd function. 

A comparison of the partition function for the non-interacting spin system of 
N s spins each with spin \ 

-jSA j3A 

Zs “ (e 2 + e 2 yys 

with equation 10.91 leads to the result 



(10.95) 


which may be used to extend the results for spin \ to particles of higher spin. 

The application of the equations developed in this section will be found in 
sections 10.6.4 and 10.6.5 but in the next section their range of usefulness for 
real magnetic systems will be discussed. 


10.6.2 Dipole - dipole interactions 

In the previous section each spin was assumed to interact directly with the 
external field PoH*. This approximation is satisfactory if the magnetisation is 
very small, as for a nuclear paramagnet, but in this section the corrections 
needed when dipole - dipole interactions are non-negligible will be considered. 

When an ellipsoid is placed in a uniform magnetic field ju 0 H* and a mag¬ 
netisation Afis produced, the macroscopic internal field is given by 

H { = H* - Td M (10.96) 


where y d is the demagnetising factor of the sample in the direction of the field. 
The demagnetising factors along the principle axes of the ellipsoid are subject to 
the sum rule 


7d x + 7d y + 7d 2 = 1 

so 7 d is equal to j for a sphere, by considerations of symmetry, to zero along a 
long rod and to \ transverse to a long rod of circular cross-section. 

The macroscopic internal field is the field used in, for example the Maxwell 
electromagnetic equations but is not correct for the calculation of the behaviour 
of localised spins. The dipoles occupy a special position in the unit cell and the 
local field at each lattice site must be found by adding to equation 10.96 a con¬ 
tribution due to all the other dipoles in the lattice. An elementary account of 
the local field will be found in Kittel (1971) and a fuller analysis in Robinson 
(1973). The simplest correction term is due to Lorentz and adds to equation 
10.96 a term y^M where y l is equal to-j. Therefore 


tf.ioc = h* + (tl - 7d )M 


( 10 . 97 ) 
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and since all three vectors lie along the same direction in the systems under 
consideration they may be written as scalars. 

The magnetisation of the system of spins interacting via dipolar forces in the 
external field fx 0 H* is equal to that of the non-interacting system in the local 
field Mo#i loc 


(H*> = M ni (//> c ) = M nl (H* + (7L - 7d)^int) 

The right-hand side may be expanded in a Taylor series, the second term of 
which is zero, since M must be an odd function of H *, and higher terms are 
negligible except for strongly magnetic substances so 


= M n { (//*) + ( 7l 


7 d )M mt (H*) 


[ 


9M ni (//*) 

m* 




T 


Therefore 


M mt (#*) = - Mni ^ - (10.98) 

l-(7L-7d)Xni 


There are unfortunately at least two possible definitions of the susceptibility of 
a system of interacting spins and although these will be distinguished by super¬ 
scripts in this section the reader should note that in most books no attempt is 
made to distinguish between them. The response of the magnetisation given by 
equation 10.98 to an external field is clearly shape-dependent so the suscepti¬ 
bility of unit volume of a given sample may be written 


v sample 
^int 



Xni _ 

1 - (7L - 7d) Xni 


(10.99) 


The sample susceptibility of a sphere is therefore equal to that of the non-inter¬ 
acting spins and in the region in which Curie’s law applies 


Xk 


sphere _ _ 


■int 


( 10 . 100 ) 


an important result which will be discussed in section 10.6.4. 

The susceptibility of the material , independent of the shape of the sample 
however is given by 


X k 


material - 


int 


ial _ ( d^int \ _ 

v dH { ) T ~ r 


Xni 


7l Xni 


7l C 


( 10 . 101 ) 


where H x is given by equation 10.96. 
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The value of Xni is always so small for nuclear paramagnets under normal 
experimental conditions that equations 10.101 and 10.99 are equivalent to 
equation 10.94. However an ingenious experiment by Abragam and coworkers 
has recently shown that if the spin temperature (section 10.5) is reduced to 
about 1 juK then the nuclear spins do form a ferromagnetic (or other ordered) 
system. 

As has already been remarked the most important interaction between 
electron spins is not normally the dipolar coupling, so although equation 10.101 
is of the form of the experimental Curie - Weiss law 

C 

X =- (10.102) 

T Q 

where 6 is a constant for a given material, the paramagnetic Curie temperature, 
the value of 0 is usually much larger than the value of 0.1 (n s /n )K calculated 
from equation 10.101. Only for the most dilute electronic paramagnets such as 
CMN is equation 10.101 useful. 


10.6.3 Thermodynamics of magnetic systems 

The second law of thermodynamics was written in chapter 3 as 

TdS = dU + 2 Y r dy r 

r 

where Y r is an intensive variable for example pressure or tension and y r the 
conjugate extensive variable (volume, length) such that Y r d y r is a work term. 
In many cases only one work term was found to be of importance and the 
equation 


TdS = dU + Ydy 

was sufficient to describe a simple system in which the state of the system was 
defined by just two independent parameters. 

The application of a magnetic field to a solid usually leads to only a small 
change in the energy of the system relative to the lattice energy so the volume 
change due to the magnetisation can be neglected. (This is not true near the 
Curie point of a ferromagnetic material but this point will not be considered 
here.) The magnetisation of a paramagnetic (or diamagnetic) substance is a single 
valued function of the external magnetic field so the system is a simple system 
as defined in chapter 2. 

A number of different expressions may be written for the magnet work term, 
depending upon the definition of the system of interest. Only the simplest, and 
most important, case will be considered here; a fuller discussion is given in 
Kittel (1969) or Robinson (1973). 

The force of attraction between a small ellipsoid with induced magnetic 
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C=> ~^ 


Figure 10.26 A possible system for performing reversible work on a sample in the field of a 
permanent magnet. The system of interest consists only of the sample. 


moment Jf{x) at a point x and a permanent magnet may be written 


F = Mo ^00 * 


bH *(x) 
dx 


where n 0 H*(x) is the field due to the magnet in the absence of the sample. 

One possible method of bringing the sample from infinity to a point x 0 by a 
quasi-static process is shown in figure 10.26. The work done on the sample is 


W A = 


Mo 


i 


dx 


dx = - Mo r 

Jo 


H 0 H*(x 0 ) 


m* 


Therefore 


6W A = - Mo (10.103) 

and the Helmholtz free energy of the magnetic part of the system may be 
written 


dF A = - S dT + tfH' 

= - SdT - /io ^dH* (10.104) 

where the component of jft parallel to H* is to be used in equation 10.104. It 
is shown in Kittel (1969) and Robinson (1973) that this form for the Helmholtz 
free energy is related to the partition function by the usual expression 

F a = - kT InZ 


Therefore 
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in agreement with the special result already obtained for a simple paramagnetic 
substance. The effect of interactions between the spins must therefore be 
included in the Hamiltonian of the system and hence in the partition function. 

The magnetic analogues of the Maxwell relations derived in section 4.2 may 
be established from equation 10.104. The most useful are 


\M*} T \bT )h* 


(10.106) 


derived directly from equation 10.104 and since 


/dS\ _/as_\ 

\bH*) T ~ \bMj t \ bH 

Kajeir \*tI 


(10.107) 


The application of these two Maxwell relations to the ideal paramagnet with 
the equation of state 


will now be considered. According to the third law of thermodynamics, the left- 
hand side of equation 10.106 must go to zero as T-+ 0 K , so an ideal paramagnet 
cannot exist in this limit, in agreement with the special result obtained in equa¬ 
tion 10.92. Secondly, the change in the magnetic entropy of an ideal paramagnet 
is a function only of the magnetisation since 


■i (f).- 


Mi 2 ) (10.108) 


Two final results, valid for any paramagnet, will be derived in this section. 
Firstly, a reversible increase in the magnetisation of a paramagnetic material at 
constant temperature leads to a flow of heat out of the material. Considering 
the entropy as a function of the magnetisation and the temperature 


T r 95 ) AT + T 
\bT Iff 




so at constant temperature 


TAS = AQ = T 


(•*-) 


AM 




A M (10.109) 
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( dH*/dT)^f is always positive for a paramagnet so heat always flows out of the 
system when the magnetisation is increased at constant temperature. 

Secondly, an isentropic decrease of the external magnetic field always leads to 
a decrease in the temperature of a paramagnet. Considering the entropy as a 
function of temperature and magnetic field 


TdS 




d H* 

T 


= C// dr + HoT 



I d H* 

H* 


where C H is the heat capacity in constant external magnetic field. The tempera¬ 
ture change due to a small isentropic change in the magnetic field is 


- T 


AT = 


\ 

, 'dT ) 
Ch 


Mo AH* 
H* 


( 10 . 110 ) 


(3 !<$T)h* is always negative at temperatures above absolute zero so the 
temperature of the sample will tend to decrease when the magnetic field is 
reduced. 

These two results are important in the theory of adiabatic demagnetisation 
which is discussed in section 10.6.5. 


10.6.4 The thermodynamic temperature scale at low temperature 

The thermodynamic temperature scale was seen in section 3.1 to be identical to 
the perfect gas scale. At normal temperature it is therefore possible to establish 
the thermodynamic scale by using a gas thermometer and applying a simple 
correction, such as the virial coefficient B v in equation 2.10, to the equation of 
state to allow for interactions between the molecules. It is possible to extend 
the scale down to about 1 K by measuring the saturated vapour pressure of 4 He 
and to 0.3 K using the rare isotope 3 He (section 10.1). 

In section 10.4 however it was seen that 3 He could be cooled to 2 x 10~ 3 K 
(2 mK) using Pomeranchuk cooling and a similar temperature can be reached by 
the technique of adiabatic demagnetisation discussed in the next section. The 
measurement of temperature in this region can be carried out most simply by 
measuring the susceptibility of either a nuclear paramagnet or a highly dilute 
electronic paramagnet. The susceptibility of a sphere of CMN, for example is 
found to follow Curie’s law down to the mK region as would be expected from 
equation 10.101 if only dipolar interactions are important. 
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A magnetic temperature T* is commonly defined by the equation 

( 10 . 111 ) 

where Xj nt sphere is the measured susceptibility of the sphere and by equation 
10.100, T* is identical to T while Curie’s law holds. At sufficiently low tem¬ 
perature it has already been remarked that Curie’s law must break down but it is 
still possible to correct T* to T if the specific heat of the substance has been 
measured (see Zemansky 1968). 

The use of a sphere of CMN has become standard in the very low-temperature 
region and the correction to T* is only about five per cent at a temperature of 
5 mK rising to thirty-three per cent near 3 mK. At still lower temperatures it is 
possible to use nuclear magnetism as an accurate thermometer since a nuclear 
paramagnet has been seen to follow Curie’s law down to the region below 0.1 
mK. 

The weak nuclear paramagnetism can be separated from other magnetic 
effects in the solid by the method of nuclear magnetic resonance. The sample is 
placed in a uniform magnetic field iiqH* and an alternating radio frequency field 
of frequency v , with its magnetic vector at right angles to ju 0 H* is absorbed 
when the resonance condition 

hv 0 = 2j uhqH* = A 
is satisfied (see section 10.5). 

The strength of the resonance signal depends upon the difference 
occupations of the two levels (for spin as was discussed in section 

/1 _ e - 0 A \ 

Signal = constant x N s I-) 

Vl + e”0 A / 

constant 

=- (j8A ^ 

T 

provides a direct measurement of the thermodynamic temperature. 

10.6.5 Adiabatic demagnetisation 

The attainment of temperatures in the mK region by the method of Pomeranchuk 
cooling was discussed in section 10.4. At such low temperatures it becomes 
difficult to establish thermal equilibrium between different parts of a system and 
the disadvantage of Pomeranchuk cooling was seen to be that solid 3 He forms 
in the warmest part of the system and therefore tends to isolate the sample from 
the liquid 3 He. Pomeranchuk cooling has therefore been largely restricted to the 
study of the properties of 3 He in the mK region since the sample then ‘cools 
itself. 


in the 
10.5 so 

( 10 . 112 ) 

1 ) 
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The other important method of reaching the mK region was developed before 
Pomeranchuk cooling and is called, rather misleadingly, adiabatic demagnetisa¬ 
tion. The principle, as opposed to the practice, of the method is straightforward. 
The entropy of a system of particles with spin-| is simply given by the result for 
the two-level system, equation 7.46 

S s = N s k £ln (1 + e~ 2 *) + J (10.113) 

where 

moH Xoc 

X = 

kT 


A reversible adiabatic (isentropic) decrease of the external magnetic field from 
p 0 H i* to iiqH 2 * must take place at constant x and leads to a decrease in the 
spin temperature from T x to T 2 given by 


H x loc _ H 2 loc 
T x ~ T 2 


(10.114) 


where H Xoc is related to H* by equation 10.97. This equation may also be 
derived from thermodynamics, exercise 10.9. 

An external field as large as 5 T is now commonly available using a super¬ 
conducting magnet, and reducing the current in the solenoid to zero would 
leave only the small field due to dipole - dipole interactions, say 0.01 T in a 
dilute electronic paramagnet, so the spin temperature could be reduced by a 
factor of 500 using this technique. The reduction factor would be even greater 
in a nuclear paramagnet where the dipolar field is only about 10“ 4 T. 

The reason that the term ‘adiabatic demagnetisation’ is rather misleading is 
that for an ideal paramagnet the magnetisation of the system is constant in this 
process if HqH 2 * is not reduced to zero since the magnetisation is a function of 
H/T (equation 10.93). 

Equation 10.114 relates the initial and final spin temperatures of the system 
but is only correct for the final temperature of the whole system of lattice and 
spins if the entropy of the lattice is negligible. In general 


AS s + AS) = 0 (10.115) 

for a reversible adiabatic decrease in the magnetic field, where S x is the entropy 
of the lattice. 

In an insulator at low temperature, from section 10.3 
Cy pT\ A 

AS, = J r — dT = A I r 2 dr = (r , 3 - r, 3 ) (10.116) 

1 T ‘'m 3 
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x 


Figure 10.27 The entropy of a simple paramagnet as a function of x = When the 

entropy of the lattice is negligible the entropy of the spin system is unchanged after 
adiabatic demagnetisation and equation 10.114 applies. When the entropy of the lattice 
cannot be ignored the spin entropy must increase to balance the decrease in lattice 
entropy at low temperature. (See exercise 10.8.) 


where 7\ is the initial temperature of spins and lattice and 7] is the final lattice 
temperature. The decrease in the lattice entropy must be balanced by increasing 
the spin temperature from the value T 2 (calculated using equation 10.114) to 7J. 

Although it is straightforward to write equation 10.115 explicitly using 
equations 10.113 and 10.116 the important condition for the difference 
(T\ - T 2 )/T 2 to be small can be seen at once from the diagram for the spin 
entropy, figure 10.27. Unless the temperature can be lowered sufficiently for 
AS) to be negligible relative to A^ the spin system must initially be prepared on 
the steepest part of the spin entropy curve. A large change in spin entropy then 
leads to only a small change in temperature. 

The condition is therefore 

* kT x 

where the difference between the external and local fields has been neglected. 

A liquid helium cryostat using 3 He may be used to cool the paramagnet to 
0.3 K in a field of 5 T but even in this most favourable case the value of x is 
only « 10 -2 for a nuclear paramagnet, so direct cooling from liquid helium 
temperatures to the mK range is not possible using nuclear demagnetisation. In 
an electronic paramagnet such as CMN however it is simple to make x ^ l and in 
principle to obtain cooling to a few mK. A schematic diagram of the technique 
is shown in figure 10.28 and a numerical example will be found in exercise 10.8. 
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Pump Pump 



Figure 10.28 (a) The electronic paramagnet (E) is magnetised at constant temperature 
(« 1 K). (b) The heat leak to the helium bath is broken and the external field reduced 
slowly to zero. The temperature of the system falls to a few mK. (c) The nuclear 
paramagnet (N) is now magnetised. The temperature of the combined magnetic system 
therefore increases slightly as may be seen from equation 10.109. (d) Finally the 
nuclear paramagnet is isolated and demagnetised and the temperature falls to the juK 
region. 

The experimental difficulties associated with the method are related to the small 
value of the heat capacity at such low temperature. A minute heat leak into the 
system or the smallest mechanical vibration will prevent the attainment of the 
temperature expected on the basis of equation 10.109. 
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A further development of the method of adiabatic demagnetisation has led to 
the lowest lattice temperatures yet attained. The idea is as simple as before but 
the experimental problems are far more formidable. In the technique of adia¬ 
batic double demagnetisation (figure 10.28) an electronic paramagnet is first 
demagnetised and allowed to cool a nuclear paramagnet. The value of x in 
equation 10.113 for the nuclear paramagnet might then become as large as 0.1 
for a field of 5 T and a temperature of about 0.01 K. The thermal contact 
between the two paramagnets is now broken and the external field around the 
nuclear paramagnet reduced to zero. The nuclear spins are now in a local field 
of only about 10~ 4 T due to dipolar interactions and the final temperature may 
be as low as 1 juK. The temperature is measured using the susceptibility of the 
nuclear system. 

A good account of the method of double demagnetisation will be found in 
Zemansky (1968). It should be noted that in practical calculations for electronic 
paramagnets such as CMN the simple spin-^ model is inadequate since the Cr 3+ 
ion has non-zero angular momentum (J = 5/2) and the energy levels of the ion 
are affected by local electric fields in the crystal as well as by magnetic fields. 


EXERCISES 

10.1 Find expressions for the change in temperature of a slightly degenerate gas 
of fermions or bosons after a Joule - Kelvin expansion and explain the result. 
(Hint, use PV = A +B P P + . . . and exercise 2.2.) 

10.2 Find the pressure exerted by a gas of fermions at absolute zero at the same 
density as conduction electrons in silver. 

10.3 A cavity contains a monatomic gas at a pressure of 1 atmosphere and 
black-body radiation. Show that at room temperature the radiation makes a 
negligible contribution to the heat capacity of the system at constant volume. 

10.4 Find an expression for the energy density of black-body radiation as a 
function of wavelength and temperature [u(\, T )]. Show X m T is a constant and 
that X m v m « 3c/5 where p m is the peak of the u(v , T ) distribution. 

10.5 Show that the lattice specific heat of an n- dimensional system is propor¬ 
tional to T" at low temperature. 

10.6 The dispersion relation for ferromagnetic excitations (spin waves or 
magnons) has the form v a k 2 for small k. Show that the three-dimensional 
magnon specific heat is proportional to T 3 ^ 2 at low temperature. 

10.7 The spin wave dispersion relation in an antiferromagnet has the form 
pock. What is the low-temperature dependence of the magnon specific heat? 
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10.8 The lattice heat capacity of a solid of N particles at low temperature may 
be written 


Cj = aNkT 3 

where a « 10~ 5 K" 3 . Find the temperature at which the lattice and spin 
entropies are equal in a magnetic field such that x in equation 10.113 is equal to 
one if N/N s is equal to 60, where N s is the number of particles with spin-|. 

(bSjdx) = - 0.2 N s k near x = 1. Show that the final temperature of the 
whole system is twice that predicted by equation 10.113 if the original lattice 
temperature is 10 K but that the lattice contribution is negligible for an original 
temperature of 1 K. 

10.9 Show, using thermodynamics only, that C H = n 0 CH* 2 /T 2 for unit volume 
of an ideal paramagnet. Hence show that if other contributions to the heat 
capacity can be neglected: H 2 */T 2 = H x *jT x for an isentropic process. (Hint: 
constant entropy is equivalent to constant magnetisation in this case.) 



11. Conclusion 


The student who has reached this point in the book and has studied the exercises 
should now have some idea of the methods and range of application of thermal 
physics. A number of difficulties were, however, evaded in this book in the 
presentation of the foundations of thermodynamics and statistical mechanics 
partly in the interest of brevity and partly because these are best left until after 
the student has some familiarity with the subject. A number of books for further 
reading are listed after the appendixes but it may also be useful to discuss briefly 
some important topics which were not covered, either because of lack of space 
or because they were too difficult for an introductory text. 

The extension of thermodynamics to systems with a variable number of 
particles (as in chemical processes) is not difficult and is discussed in Zemansky 
(1968) and in many text books on chemical thermodynamics. The grand 
canonical ensemble was seen in chapter 7 to be the appropriate ensemble for a 
system in thermal equilibrium but with a variable number of particles. In a 
large system the mean number of particles N is well defined and the grand 
canonical ensemble may then also be used_to calculate the properties of a system 
with a constant number of particles (N = N). There are often computational 
advantages in using the grand canonical ensemble rather than the canonical 
ensemble as may be seen in Kittel (1969) or Huang (1963). 

The importance of fluctuations about equilibrium values in the canonical 
ensemble has been discussed several times in this book and particularly in 
connection with critical phenomena. A connection between the fluctuations of a 
system about equilibrium and its transport properties was first established by 
Einstein while studying the theory of Brownian motion. A particularly good 
discussion of this topic is to be found in chapter 15 of Reif (1965). The general 
relation between the equilibrium and transport properties is discussed in chapter 
22 of Wannier (1966). 

Classical thermodynamics has been seen to be restricted to systems in thermal 
equilibrium but a theory of non-equilibrium systems has also now been con¬ 
structed. The theory of the transport properties of gases when the deviation 
from equilibrium was small was considered in terms of the kinetic theory in 
chapter 9. The transport coefficients in this region are independent of the 
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driving force, as in Ohm’s law for example. The general thermodynamic theory 
of systems near equilibrium is discussed in a book by de Groot and Mazur 
(1962). A much more difficult subject which has been studied recently is the 
thermodynamics of systems far from equilibrium where the linear transport 
theory does not apply. A recent book by Glansdorf and Prigorgine (1971) 
describes the progress in this field which has applications to many branches of 
science and possibly to the problem of evolution. 

One of the main interests of current research in statistical mechanics is in the 
behaviour of strongly interacting systems such as dense gases, liquids, and mag¬ 
netic systems. The partition function of such systems can usually only be 
evaluated by approximate methods which require advanced mathematical 
techniques. A few models of strongly interacting systems can be solved exactly 
however as discussed in Huang (1963) and Wannier (1966). 
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FUNCTIONS OF TWO OR MORE VARIABLES 


When three variables are related by an equation of the form 
z = f(x,y) 

the total differential of z is defined to be 


dz 


3z\ 

— ) dx + 

dx ) y 



(A 1.1) 


where the suffix v in the first term on the right-hand side of the equation 
means that the differentiation with respect to jc is to be performed with y 
treated as a constant. A term such as ( dz/dx) y is called a partial differential. The 
equation could equally well be written in terms of x oxy and for x would read 



A number of relations exist between the partial differentials and the use of 
these relations forms an essential part of thermodynamics. Consider first the 
partial differential ( dx/dz) y . At constant y the second term in equation Al.l 
is zero so 


1 

9z )y (dz/dx) y 


The differential oiy with respect to x at constant z leads to 



(A 1.2) 
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which may be written 



(A1.3) 


The differential of z with respect to say x when some other variable 0 is held 
constant leads to the equation 



(A 1.4) 


Finally, the order of differentiation is not important 

d 2 z _ d 2 z 
dx 3 y by bx 

As an example consider 


(A 1.5) 


Z = x 2 y + xy 3 



x 2 + 3y 2 x 


d 2 z 
dx by 


2x + 3 y 2 


Alternatively 



= 2xy + y 3 


b 2 z 
by bx 


2x + 3 y 2 


b 2 z 
bx by 


An expression of the form 

A dx + B dy (A1.6) 

where A and B are functions of x and y is called a perfect (or exact) differential 
if some function z exists such that 


dz = A dx + B dy 


(A1.7) 
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The test for a perfect differential is that 


( dA ) = (A1.8; 

\dy/x \dx )y 

since in this case A must be equal to Cdz/dx) y and the equality in equation A1.8 
follows from equation A1.5. 

When the condition given by equation A1.8 is not satisfied, equation A1.6 
is termed an imperfect differential. It may be shown that an integrating factor 
(X) always exists for a function of two variables such that 

Xv4 dx + XB dy 


is a perfect differential (where X is some function of jc and y) but for a function 
of more than two variables an integrating factor does not exist in general. The 
second law of thermodynamics expressed (in part) by the form ‘an integrating 
factor always exists for the imperfect differential form of the first law of 
thermodynamics’ is therefore a law of physics, not of mathematics. 

As an example consider the expression 


diL = xy 2 djc + (x 2 y + xy) d y 
where cf means that cfZ, is an imperfect differential since 


However 


*). ' ^ SI 


(2x + 1 )y 


dz = = y dx + (x + 1) dy 

xy 

is a perfect differential. In this case the integrating factor X and function z are 
given by 


X = (xy) 1 ; z = (x + 1)^ 

The integrating factor is not however unique since the combination 
X* = 2(x + 1 )/x; z* = z 2 

is also satisfactory. Notice that a suitable choice of z* has led in this case to X* 
being a function of only one variable. 

The fact that two functions (z, X) are available explains why an integrating 
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factor always exists for a function of two variables but for functions of three or 
more variables an integrating factor exists only for a restricted class of functions. 

In the case of the differential form of the second law of thermodynamics 
discussed in chapter 3 

dS = X CTQr 

The choice of the entropy ( S ) as a extensive quantity makes X a function of only 
one variable, the temperature. 

A fuller account of the relation between the mathematical and physical state¬ 
ments of Caratheodory than that given in chapter 3 will be found in the book by 
Buchdahl (1966) quoted in the reading list. 



Appendix II 


USEFUL MATHEMATICS 

Many of the integrals which occur in statistical physics may be reduced to 


standard forms. The gamma function is defined to be 

r(«) = f e”**" -1 dx (n > - 1) (A2.1) 

J 0 

Notice that the power of x is (n - 1) for r(«). The following relationship exists 
between the gamma functions 

T(n) = (« - l)r(« - 1) (A2.2) 

and in particular 

T(l) = 1 (A2.3) 

r(|) = \Ar (A2.4) 

An integral of the form 

I(n) = f e~ axi x n dx (n > 0) (A2.5) 

J 0 

where a is a constant may be reduced to the form 

m = jr ” (A2.6) 

An integral of the form 

x n dx = 2/(«) (n even) (A2.7) 

= 0 (n odd) (A2.8) 
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The Riemann zeta function 

?( z ) = £ V (z > 
which has the values 

2 4 

?(!) = 2.61 f(2) = ?- f( 4 ) = " 0 

is useful for the evaluation of the integral 

r?.I^ = tW r W (A2.9) 

Jo Q X — 1 

Stirling’s approximation 

ln«! = n\nn - n + \ln(2ivi) (n.> 1) 

where 

n\ = n(n - 1) (n - 2) ... 3 x 2 x 1 

is accurate to better than one per cent for n greater than ten. When n is very 
large the last term is negligible and the form 

hull = n Inn - n (A2.10) 


is adequate (chapter 6). 
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LAGRANGE UNDETERMINED MULTIPLIERS 

The total differential of a function of two variables (Appendix I) may be written 



The condition for a maximum value of z is not only that dz is zero but that each 
term is zero. 



(A3.1) 


since x and y are independent variables. 

However when a maximum value of z subject to a constraint of the form 

f(x,y) = constant (A3.2) 

is required, x and y are no longer independent since 



(dz/dx) y = (dz/dy) x 
(df/dx) y (df/dy) x 

The equations may now be written in terms of the (unknown) constant (X) 
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These equations would result if the function 
<l>(x,y) = z - \f(x,y) 


were maximised without a constraint, as in equation A3.1, so the effect of the 
constant X (the Lagrange undetermined multiplier) is to decouple the equations. 
The complete solution of the maximum, subject to the restraint, has of course 
only been postponed since X is unknown but Xmay be carried to the end of the 
problem and then adjusted to give the correct value of the constant in equation 
A3.2. 

The extension of the method when z is a function of n independent variables 
subject to m constraints (m < n) is straightforward since one undetermined 
multiplier is simply introduced for each constraint and the terms become of the 
form 


(bz" 





• 1! ”) .0 

ix) ,• 


where the suffix x' means that all variables apart from x are to be held constant. 



Appendix IV 


DENSITY OF SINGLE-PARTICLE STATES 


The allowed solutions of any wave equation depend upon the boundary condi¬ 
tions imposed on the wave. The most familiar example is perhaps that of a 
stretched wire held at both ends. The only standing transverse waves which can 
be excited in the wire are those which have zero amplitude at both ends. This 
restriction means that the wavelengths of the standing waves must be given by 
an equation of the form 

n-=L (n = 1,2,3,...) (A4.1) 

2 


where L is the distance between the ends of the wire. 
The wave vector (fc) is defined by 

k = ^- 
X 

so the allowed wave vectors are given by 


(A4.2) 


* = - («= 1,2,3,...) 

L 


(A4.3) 


The allowed wave vectors may be represented by a set of points spaced at 
intervals of n/L. The space so defined (in this one-dimensional case simply a line) 
is called k-space and has the dimensions of inverse length. 

The concept of discrete allowed solutions to a wave equation is not therefore 
in any way restricted to quantum mechanics but became of greatly increased 
importance after the introduction of the Schrodinger wave equation. The case of 
a particle in a box will now be considered in detail and then the general results 
for different kinds of system will be discussed. 
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In one dimension the Schrodinger equation for a free particle of mass m may 
be written 


d 2 \p Sn 2 me 

— + \b = 0 

djc 2 h 2 


The solution to this equation is of the form 
i// = Ae ik * x 


h 2 k 2 
87 T 2 m 


Px 

2m 


Px = 


27r 


(A4.4) 


(A4.5) 

(A4.6) 

(A4.7) 


where A is a normalising constant, e the energy of the particle, and p x the 
momentum in the x-direction. The last equation is simply the de Broglie relation¬ 
ship between the momentum and wavelength of a particle. 

When the particle is free to move only within the confines of a box (or in the 
present one-dimensional case along a line of length L ) the wave vector must 
again be restricted to those values which satisfy the boundary conditions. The 
boundary conditions are less obvious than for the case of the stretched string 
but when the box is large the final results for the macroscopic properties of the 
system cannot depend upon the exact size of the box or the choice of boundary 
condition. One possible boundary condition is that the wave function vanish at 
the surface of the box. This is equivalent to the standing wave solution of 
equation A4.1. 

An alternative boundary condition, which will be used throughout the book, 
is found by joining the two ends of the length L and considering running waves 
around the loop. This may be expressed 

\p(x) = \j/(x + L) (A4.8) 

and is known as a periodic boundary condition. The allowed wavelengths are 
now 


nk = L (n = 0,± 1,±2,± . . .) (A4.9) 

k x =- (n = 0, ± 1, ± 2,. . .) (A4.10) 

L 


The allowed energies and momenta of the particle may now be found by sub¬ 
stituting for k x in equations A4.6 and A4.7. The energy levels for periodic 
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boundary conditions in one dimension are all (except for the ground state in 
which n equals zero) two-fold degenerate since the states with ± n have the same 
energy. 

When L is large the allowed values of k (the microstates, or states of the 
system, since they represent the solutions of the Schrodinger equation) are 
sufficiently close together to be represented by a continuous function called the 
density of states. The exact meaning of ‘sufficiently close’ will be discussed after 
the results for three dimensions have been obtained. 

The number of allowed states per unit length of the line in k-space is, from 
equation A4.10, equal to L/2n . 

The density of states function in one dimension is defined as 

D(k) d k = number of states in the range k to k + d k 


2n 


When the particle is restricted to a two-dimensional box of side L the allowed 
states for periodic boundary conditions are given by 


k x = x (n x = 0,±1,±2±...) 

Ls 

k y = ^ ?r -“ (n y = 0, ± 1, ± 2 ± . ..) 
L 


and the allowed energy levels by 


h 2 ( k x 2 + ky 2 ) 
&7i 2 m 


h 2 

2 mL 2 


(n x 2 + n y 2 ) 


(A4.12) 

(A4.13) 

(A4.14) 


The energy levels above the ground state can be more degenerate than in one 
dimension because the same energy can be obtained, for example, with either n x 
equal to ± 2 and n y equal to ± 1 or vice versa. 

In two (and three) dimensions the term ‘density of states’ is commonly used 
for two rather different quantities and this sometimes leads to confusion. The 
allowed states in two dimensions form a square array in k-space with side 2ti\L 
(figure A4.1). 

The number of states in a small element of area d 2 A: is 

(A4.15) 

where the symbol d 2 k is used to stress that A: is a vector quantity. The density of 
states function in this sense is a constant thoughout all the k-space as may be 
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Figure A4.1 The allowed single - particle states in a two-dimensional box of side L form a 
square array in k-space with spacing 2ir/L . The allowed states with wave vector of 
magnitude between k and k + dk lie in the shaded area. 


seen from figure A4.1. 

The alternative use of the term ‘density of states’ refers to the magnitude of 
k (a scalar quantity). D(k)dk is the number of states with wave vectors of mag¬ 
nitude between k and k + d k. The wave vectors of constant magnitude k lie on a 
circle about the origin in k-space (figure A4.1). The area between vectors of 
magnitude k and k + dk is simply 2rrk dk and therefore 

/ / \ 2 T 2 

D(k) dk = - 2nk dk = — k dk (A4.16) 

\27Jv 2n 

When scalar quantities (such as energy) are under consideration it is this density 
of states function which is of importance. The relationship between equation 
A4.15 and equation A4.16 may be clarified if the area element of d 2 k is written 
in polar coordinates. Then equation A4.15 becomes 

^ 2 k dk dip (A4.17) 

The scalar density of states therefore results from integrating equation A4.17 
over the whole angular range 0 to 2ir at constant wave vector. 
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The extension of these concepts to three dimensions is straightforward. The 
periodic boundary conditions may still be used although in fact the opposite 
faces of a cube can only be joined together in a four-dimensional space. The 
energy of the particle is given by the obvious generalisation of equation A4.14 


h 2 (k x 2 + k y 2 + k z 2 ) _ h 2 n 2 
Sn 2 m 2 mL 2 

n 2 = n x 2 + n y 2 + n 2 


(A4.18) 

(A4.19) 


The energy e n depends only upon n 2 , but the degeneracy of an energy level 
depends upon the number of ways n 2 can be formed from values of n x , n y and 
n z in equation A4.18. 

The allowed states form a cubic array in the three-dimensional k-space with 
spacing 2n/L. The density of states may be written either as 


L 

2tt 


3 

d 3 k 


(A4.20) 


if the vector quantity is under consideration (where d 3 k may be written in any 
convenient form such as 


d 3 k = d k x d k y d k z (A4.21) 

in cartesian coordinates or 


d 3 k = k 2 d k sin 6 dd d<j> (A4.22) 

in polar coordinates) or as the scalar quantity defined in equation A4.11. 

The volume enclosed between the spheres of radius k and k + dk is 
4nk 2 dk so the scalar density of states in three dimensions is 

D(k) dk = (A) 3 4nk 2 dk = -- k 2 dk (A4.23) 

\2nJ 2 ir 2 


which, as in the two-dimensional case, could have been obtained by integrating 
over the angular part of equation A4.20 at constant k. 

Although equation A4.23 was derived for the specific case of a cube it is in 
fact correct for any large volume V. The density of states function given by 
equation A4.23 is also obtained when the standing-wave boundary conditions 
(equation A4.3) are used. In this case the points are twice as close together in 
k-space but the allowed values are restricted to one octant of the sphere (since 
all the components of k must be positive). 

The physical meaning of the density of states function may be made apparent 
by considering the states in a space of n dimensions of momentum and n of 
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position where n is 1,2 or 3. This space is called a phase space. The wave vector 
and the momentum of the particle are related by equation A4.7 so equation 
A4.20 may be written 

3 

d 3 p (A4.24) 

or generalising the volume term 



d 3 p d 3 r 
h 3 


(A4.25) 


The meaning of equation A4.25 may be expressed in words as ‘a single-particle 
quantum state occupies a volume h n in a phase space of 2 n dimensions!. 

This fundamental result is obviously a feature of the Heisenberg uncertainty 
principle and illustrates the difference between classical physics (which places 
no restriction on the spacing of the states) and quantum physics. 

The most general form of the density of states function for a particle is 
therefore given by equation A4.25. When only the scalar density of states is 
required and gravitational effects are neglected this becomes (from equation 
A4.23) 

4nVp 2 

D{p) dp =-— dp (A4.26) 

h 3 

using the mathematical relationship 

Dip) dp = Dik) &P (A4.27) 

where the meaning of Dip) is 

Dip)dp = Number of states with magnitude of momentum 

between 


p and p + dp (A4.28) 

The density of states function can also be expressed as a function of the 
energy of the particle provided that the relationship between the energy and the 
momentum is known. In Newtonian mechanics the energy and momentum are 
related by the equation 


e = \ mv 2 


2m 


(A4.29) 


The density of states as a function of energy may then be written (using 
equation A4.26) 
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D(e) de = 2 me ^ de (A4.30) 

= 2jrFQ^ 2 eide (A4.31) 

A particle in the extreme relatavistic limit however has the energy momentum 
relation 


e = cp (A4.32) 

where c is the velocity of light. It is still permissible to use equation A4.23 or 
equation A4.26 (although equation A4.30 does not apply) and therefore 

477' V 

D(e) de =-- e 2 de (A4.33) 

(he) 3 

Notice that this equation cannot be obtained from equation A4.30. The basic 
equations are A4.23 and A4.26. 

When the particle has spin (s) each energy level, in the absence of a magnetic 
field, has an increased degeneracy since (2s + 1) states have the same kinetic 
energy. The density of states function, such as equation A4.31, must then be 
multiplied by (2s + 1). An electron for example has s = 1/2, (2s + 1) = 2. The 
form of equation A4.31 is correct for one direction of spin or for a particle with 
zero spin. The degeneracy factor will be written g where g is equal to (2s + 1) 
for particles with spin s, to 2 for electromagnetic waves and to 3 for lattice 
waves, as will be seen below. 

The results of this section for the Schrodinger wave equation of a particle can 
be carried over to other wave equations using the basic result of equation A4.23. 
An electromagnetic wave in a cavity for example (section 10.2) has the same 
energy density of states as an electron in the extreme relativistic limit which is 
apparent from treating the electromagnetic field in terms of a photon gas since 
photons obey equation A4.32. The degeneracy factor is again 2 due to the two 
directions of polarisation of the wave. 

A more complicated situation arises in the theory of the lattice vibrations of 
a solid (section 10.3). The total number of allowed modes of the system is now 
limited to 3N for N atoms in a three-dimensional crystal. The exact treatment of 
this problem will not be considered here but in the extremely useful Debye 
approximation the density of states may be found immediately from equation 
A4.23. In the Debye approximation the N atoms of the crystal are considered to 
form a continuum for which the frequency (v) and wave vector are related by 


2nv 

C s = V A = " 
k 


(A4.34) 
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where c s is the velocity of sound in the crystal. The density of lattice vibrational 
states as a function of frequency is, from equation A4.23 

4nV 

D(p) dp = 3 x- v 2 dp (A4.35) 

C s 3 

but the number of states must be restricted to 37V so there is a maximum value 
of the frequency (V D ) given by 


Therefore 



D(p)dp = 37V 




3 Nc*\l 
4nV ) 


(A4.36) 


(A4.37) 


The density of states in equation A4.35 includes the three possible degrees of 
polarisation of the lattice vibrations. 

The density of states function was stated to be a good approximation to the 
actual discrete allowed energy states provided that the states were ‘sufficiently 
close together’. In the particular case of a particle in a box the meaning of 
‘sufficiently close’ may be illustrated in terms of equation A4.18. 

The mean kinetic energy of a single particle in a box at temperature T is 
3kTj2 (section 7.5). This defines a value of n 0 from equation A4.18 


n 0 


2 


2mL 2 e _ 3 mL 2 kT 
h 2 " h 2 ~ 


(A4.38) 


The separation of adjacent energy levels is 
h 2 

Ae = „ - ~ 2 (2«o - !) 
2 mL 2 

which may be written in temperature units 
equation 


h 2 n 0 
mL 2 


(A4.39) 


by defining a temperature T g by the 


Ae = kT g (A4.40) 

An electron in a box of side 10 -2 m at 300 K then has 

n 0 » 10 s T g » 10~ 5 K <T (A4.41) 
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so the separation between the energy levels is many orders of magnitude less than 
the thermal energy. 

The density of states function may always be used in the classical limit and 
also for the Fermi - Dirac gas but has to be modified for the ground state of a 
Bose - Einstein gas in the low-temperature limit (section 10.1). 
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Further Reading 


GENERAL TEXTS 

A number of books cover much the same material as this book in greater detail and 
at a more advanced level. The full titles are listed in alphabetical order by author 
in the reference section. 

Kittel (1969), Mandl (1971), Reif (1965) and Wannier (1966) all develop 
thermodynamics and statistical mechanics in parallel rather than (as in this book) 
as separate approaches to the study of systems in thermal equilibrium. Mandl 
(1971) is the least advanced of the four and is restricted to systems in thermal 
equilibrium. Kittel (1969) begins by discussing a model system in the same spirit 
as section 6.3. He then goes on to cover many interesting applications of statisti¬ 
cal mechanics using the grand canonical ensemble but the foundations of 
statistical mechanics are not considered in any detail. Reif (1965) is an extremely 
detailed text, to the earlier sections of which this book may serve as a guide. The 
final chapters on transport theory and fluctuations are outstandingly good. 

Wannier (1966) also covers both equilibrium and transport theory and contains 
some unusual examples as well as a careful discussion of the relaxation-time 
approach to the Boltzmann equation. 


THERMODYNAMICS 

Zemansky (1968) is not more advanced than this book but is useful for its very 
clear presentation and many tables and diagrams of experimental results. A good 
advanced text with an approach quite different to that adopted here is Callen 
(1960). The foundations of thermodynamics are discussed with great clarity in 
Buchdahl (1960). A full mathematical treatment of the Caratheodory approach 
to the second law of thermodynamics is given and comparison made with the 
earlier versions of the law due to Clausius and Kelvin. 
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STATISTICAL MECHANICS 

The classic text on the foundations of statistical mechanics is Tolman (1938 — 
reprinted many times). A good account is also given in Huang (1963). In addi¬ 
tion to the books listed under ‘general texts’ one other outstanding advanced 
book on statistical mechanics is that by Landau and Lifshitz (1959). 


TRANSPORT THEORY 

Reif (1965) and Wannier (1966) both contain good discussions of kinetic theory. 
A detailed review of the solution of the Boltzmann transport equation is given 
in Hirschfelder et al (1954) but some of the more advanced sections of this 
book on dense gases and liquids are now out of date. The obvious applications of 
kinetic theory to vacuum physics was remarked on in chapters 8 and 9. A useful 
introduction to ultra-high-vacuum techniques and interaction of the atoms of a 
gas with a surface is given in Robinson (1968). Ziman (1960) contains a good 
discussion of the use of the Boltzmann transport equation in the theory of 
solids. 


CHAPTER 10 

10.1 Kittel (1969) and Huang (1963) give good discussions of Fermi -Dirac and 
Bose - Einstein statistics. Kittel (1971) is an excellent introduction to the pro¬ 
perties of electrons in solids. The properties of liquid helium are discussed in 
Tilley and Tilley (1974). 

10.2 The black-body background radiation is discussed in a book by Sciama 
(1971) and in an article by Webster (1974). 

10.3 Kittel (1971) and Gopal (1966) contain theory and comparison with 
experiment. 

10.4 The theory of critical phenomena is covered in a recent book by Stanley 
(1971). Pomeranchuk cooling is discussed by Huiskamp and Lounasmaa (1973) 
and the superfluid properties of 3 He in the temperature range below 3 mK by 
Lounasmaa (1974). 

10.5 The thermodynamics of a system with a negative temperature was first 
discussed by Ramsey (1956). The concept of negative temperature is also used 
on books on masers and lasers, as for example Unger (1970). 

10.6 A good account of the thermodynamics of magnetic systems is given in 
Kittel (1969) and, at a more advanced level, in Robinson (1973). 



Solutions to Exercises 


2.1 The Celsius temperature on the resistance scale is defined 

e R = Re -^ R -° x 100 °c 

^100 “ Ro 

(since ice and steam points have the same value on all scales). At 70 °C on the 
gas scale 


R e = R 0 (1 + 70a + 70 2 © 


Therefore 


0 R = (70a + 70 2 j3)/(a + 100© = 72 °C 
2.2(a) At low density the equations must reduce to the perfect gas law. Now 
(PV)_> A V^ m ;(PVU A P^o 


A = RT 

(b), (c) 



Equating coefficients of 1/K 2 


Bp — By 
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Equating coefficients of 1 IV 3 

C v = B P 2 + AC P 


(d),(e) 


V - b V 2 



Equating coefficients oil IV 2 

By ~ b a/RT 
Equating coefficients of l/V 3 
Cy — b 2 


— (b < V) 
V 2 


2.3 ( dP/dV) T = 0 at critical point 
therefore 

P = a(V - 2 b)/V 3 

On differentiating again with respect to V and equating to zero 
V c = 3b 

substituting for V in the first equation then leads to 
P c = alllb 2 

and substituting the van der Waals equation with these values gives the other 
equations. 

2.4fa) C V 

W = - J y P dV = - P(V 2 ~ Vi) for P constant 


(the sign convention is such that W is positive if work is done on the gas, that is 
to say when V 2 is less than V x ). 


(0 

00 


W = - R(T 2 - T x ) 
W = - R(T 2 - T x ) 
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(iii) 

(iv) 


(A-i) 

- " <r * - ri) - ‘ 


In equation iv the term in ‘ a ’ represents the work done to overcome the attrac¬ 
tive force between the molecules. The term in V represents the repulsive forces 
Notice that for expansion at constant pressure the repulsive term is not impor¬ 
tant. 


2.4 (b) 

0) 


00 


(iii) 


(iv) 



P dV = 



dV 

V 


- RT In 



W = - RT In 



W = 

W = 


- RT In 


- flrin 


- rtrin 



= - ^rin 


Vi 

Vi 


Rb(T-T B ) 



v 2 


) 


Notice that the second equation of state led to the same work-term as a perfect 
gas in part (a) but not in part (b). Similarly the van der Waals gas would behave 
as a perfect gas at the Boyle temperature in part (b) but not in part (a). A real 
gas may give the same result as a perfect gas under special conditions, without 
obeying the perfect gas law. 


2.5 AU = AQ + AW; AW = - P(V 2 - V t ) 

AQ = C P (T 2 T,) = C P P(V 2 - V^/R 


AU = P(V 2 - Fj )l(y - 1) 

= 7.5 x 10 s J ( C P - C v = R) 


Therefore 
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2.6 



PdV = - 2RT\n2 = - 6.9 kJ 


Work done by gas = 6.9 kJ. Heat supplied also = 6.9 kJ because A U= 0 for a 
perfect gas at constant temperature. 


2.7 Along ab W = - RT X In 2; A U = 0 Therefore Q = RT X In 2 
Along be W = RTJ 2; Q = - C P T X \2 

Therefore A U = — CyTi/2 

Alongca W = 0;g = CV7\/2 Therefore A U = C V T X \2 


Therefore 

in cycle W = — 7^^ In 2 + RT x j2 < 0 
g = In 2 - RTJ2 > 0 


Only Uis a function of state since AU= 0. 

2.8 W = — P(V 2 where P is 1 atmosphere, V 2 the volume of 1 mole of 
vapour at the boiling point and V x the volume of 1 mole of liquid. V x is negligible 
relative to V 2 . 

Therefore W = - PV 2 = - RT b = - 313R = - 3.1 kJ 

The work done by the vapour is 3.1 kJ. The heat supplied was 41 kJ. Therefore 
38 kJ was used to overcome intermolecular forces and only 3 kJ to perform 
external work. 

2.9 cTh# = dl represents work done on the system. At constant tension 0 

W = ill - h) = ^oloc ( T 2 - T x ) 

where a is the mean coefficient of linear expansion in the temperature range 
T x to T 2 . 


2.10 If W were a function of state dW would be a perfect differential (appendix 
I) and dr+ ( 9 ^ 

JTjv \w)t 


dH/ 


dV 


but dW = 0 dr + PdV 

and condition for a perfect differential becomes (bP/dT)y = 0. W cannot there¬ 
fore in general be a function of state since the pressure and temperature of a 
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system are related by the equation of state. 


3.1(a) The energy equation (3.14) leads to the results 

(i) (dU/dV) T = 0, therefore U = f(T) 

(ii) As (i) 

(iii) (bU/bV) T = a/V 2 ,U( 2) - U( 1) = a (y- - yj + f(T) 

(iv) As (iii) 

In the case of the perfect gas, U = f(T) = C V T and since the other equations 
reduce to the perfect gas as V approaches infinity (low density) f(T) has the 
same form for the other equations. 

(b) C P - Cy = T(dP/dT) v (dV/bT) P 

(using equations 2.34 and 3.14) 

The results are therefore 

(0 R 

00 R 

(iii) R( 1 - la/RTV)- 1 « R{ 1 + 2 a/RTV) 

(iv) R[ 1 + 2a (V - b)/RTV 2 ] « R (1 + la/RTV) 

The repulsive term V has little effect on C P - C v . 

(c) Use equation 3.20 (d) Put dS = 0 in equation 3.15. 

Therefore C v dT/T= - (i bP/bT) v dV 


Assume Cy is independent of T 


(0 

XV R / Cv = constant 

(ii) 

T(V - b) 1 ^ Cv = constant 

(iii) 

Xv R /°v = constant [C P - Cy ^R, see section (b)] 

(iv) 

T(V - b^ Cv - constant [C P - Cy i^R, see section (b)] 

3.2 Exercise 2.5: 

dS = &Q r IT = C P d T/T AS = C P In (T 2 \T X ) = (5/?/2)ln 2 


Exercise 2.6: AU = 0 TdS=PdV_ AS = 2R ln(F a /Fi) = 2R In 2 
Exercise 2.7: Along ab AS =R In 2 

Along be AS = — C P In 2 
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Along ca AS = Cy In 2 

Total AS = 0 since Cp - C V =R for a perfect gas. 

Exercise 2.8: AS = Q/T b = 110 J K -1 

3.3 See figure 3.7 

(a) Heat absorbed along ab = work done -RT A \n(V b /V a ) 

Heat rejected along cd = RT B In (V c /V d ) 

We now have to relate the volumes using the equation for a reversible adiabatic 
change 

T A v b y~ l = t b v c y ~ l , t b v d y ~ l = t a v a y ~ l 

Therefore 


V b /V a = V c /V d 

Therefore using equation 3.48; r? = (T A - T B )/T A 

(b) &Q r =C v dT+T(dPldT) v dV=C v dT + RT(V-b)- 1 dV 
along ab Q A = RT A ln[(V b - b)/(V a - b )] 
along cd Q b =RT b ln[(V c - b)l(V d - b )] 

Now use result of exercise 3.1 (c) part 4. r? = (T A - T B )/T A . The efficiency of a 
reversible engine is independent of the working substance. 

3.4 See figure 3.8. The Stirling engine rejects heat reversibly along be and 
accepts the same amount of heat along da. The net heat transfer is therefore 
only along ab and cd 

Along ab: Q A =RT A ln(V b /V a ) 

Along cd: Q B -RT B ln(V c /V d ) 

Also V b = V c -,V a = V d 
Therefore ?? = ( T A - T B )/T A 

The work done in the Stirling cycle is ^(7^ - T B ) \n(V c /V d ) 

The work done in the Carnot cycle is R(T A - T B ) In ( V c \V d ) which is less than 
that of the Stirling cycle. 

3.5 The heat transferred in a reversible process is 

<2r=J TdS 

The area enclosed inside the rectangle (figure 3.6) representing the Carnot cycle 
is therefore equal to the net heat absorbed, just as the area inside a cycle on a 
P-V diagram is equal to the net work 

3.6 Isentropic change T 3 V = constant (see equation 10.27). Therefore 
PV 4 / 3 = constant along an adiabat and P = constant along an isotherm. 
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Therefore for the isothermal at T A : 

Qa =f(P + u)dV = 4 bT A \V„ - V a )/3. 
for the isothermal at T B : 

Qb = 4 bT B \V c - V d )/3 

V b T A 3 = V c T b 3 ; F d r B 3 = F^ 3 

Therefore 

r? = (T A - T b )/T a 

3.7(a) AS 1(ake) = Q/T = 500(600 - 300)/300 = 500 J K -1 

The entropy change of the block must be calculated by considering a reversible 
path from 600 K to 300 K. A series of thermal reservoirs at intermediate tem¬ 
perature would form a reversible path and the total change of entropy is 

J 3oo d!T 

— = - 500 In 2 J KT 1 

600 T 

The total entropy change is therefore + 155 J K -1 . 

(b) A Si = 500(373 - 300)/300 = 122 JKT 1 

AS wat er = 500(600 - 373)/373 = 305 J K“‘ 

A S b = - 500[ln(600/373) + ln(373/300)] 

= -346JK- 1 

The net entropy change is therefore + 81 J K -1 . Cooling in two steps has reduced 
the net change in entropy. A series of reservoirs would make the change reversible 
and hence the net entropy change would be zero. 

4.1 P = -(bU/bV) s = 2U/3V 

T = (bUldS)v = 2U/3Nk 

Therefore 

PV = NkT; U = 3NkT/2; C v = 3Nk/2; C P = 5Nk/2 
F = U - TS = (3NkT/2) |l - ln[37T(F/A0 2/3 /2a]} 

Notice that U and F are properly extensive and P and T intensive. 

4.2 k t ~ 1 = - F(9P/9F)j’ = F(9 2 F/9F 2 ) r 

C P - C v = - T(dPldT) v 2 /(dP/d V) T 

= J(9 2 F/9T9 F)/(9 2 F/9 F 2 ) t 
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4.3 C P = (&Q R /dT)p = T(dS/dT)p = ~T(d 2 GldT 2 ) P 


4.4 The energy of the system plus the reservoirs must be a minimum. Therefore 
d (U + U R ) = d U - T R dS - PdV = 0 

Therefore 


d (U - TS + PV) = dG = 0 

Since 

T = 7r, P = P R 

5.1 A = 3.66 x 10 -6 J mol -1 . C v = 23.8 J deg -1 mol -1 

at 300 K; 25.7 J deg- 1 mol -1 (800 K). 

Notice that thp correction to constant volume at room temperature is only 
about 3 per cent of the heat capacity. 


5.2 Equation 5.13 


AT =- 


a 

C v 




5.6 K (see exercise 3.1a) 


Only the term representing attractive forces between the molecules occurs in the 
result for free expansion. Although the cooling appears to be quite large it will 
in fact be greatly reduced due to the thermal mass of the container. 


5.3 We are required to show that 

(i dU/dP) T + d(PV)/bP - V - T(bV/bT) P 
The left-hand side may be written 

(bU/bP) T + P(bV/bP) T + V = {bUlbV) T (bPlbV) T ~ l + P(bV/bP) T + V 
on substituting for ( bU/bV) T using the energy equation the identity follows. 

RTb - a 

+--— (using the binomial 

^ expansion) 

- 2 a)/RT 

Therefore 

7} = 2 a/Rb 

This is the maximum inversion temperature (low-pressure limit). 


p = RT - !L - RT 
V-b V 2 ~V 

Therefore 

T(bV/bT)p - V « (RTb 
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5.5 



If the series was terminated with the terms in C P the inversion pressure would 
be given by 

P, = - (TdBpjdT - B P )I(T dCp/dT - C P ) 

= - [d(B P IT)ld(Cp!T)] 

This equation is adequate at high temperature but does not give a finite maxi¬ 
mum value of the pressure on the inversion curve (figure 5.2) because C P /T has 
a maximum value and at this temperature P\ goes to infinity. Using the van der 
Waals equation for example 

C P = [b 2 - (b - alRT?]/RT (exercise 2.2) 

= (2 T - T B )b 2 T B /RT 3 (T B = a/Rb = Boyle temperature) 

Then C P /T has a maximum value when T = 2T B /3 

(The van der Waals result for C P is not in good agreement with experiment but 
the general conclusion is correct. The term D P becomes important at low 
temperature.) 

5.6 The important parameter in the Joule - Kelvin expansion is ( dT/dP) H given 
by equation 5.19. In a reversible adiabatic expansion ( dT/dP) s must be con¬ 
sidered. 


(dT/dP) s = - (dTldS) P (dSlbP) T = T(dV/dT)p/C P 

Therefore 

(dT/dP)s ~ otTVICp > 0 (gas cools in isentropic expansion) 

(9 T/dP) H - (9P/9P) 5 = - V/Cp < 0 

The Joule - Kelvin expansion produces a smaller temperature drop than a 
reversible adiabatic expansion but involves no moving parts, which is an advant¬ 
age at low temperature. 

6.1 If the magnitude of v x is independent of v y and v z> the number of molecules 
with simultaneous components in the range v x to v x + dv x , v y to v y + dv y ,v z to 
v z + du z is simply the product function 

Nf(v x )f(v y )f(v z )dv x d v y dv z 

This product, by assumption, (a) must be a function of the magnitude of the 
velocity, that is to say of (v x 2 + v y 2 + v 2 ) 
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Therefore 


g(v 2 ) = g(v x 2 + v y 2 + v 2 ) = f(v x )f(vy)f(v 2 ) 

The solution is of the form 

f(v x ) = A exp (Bv x y, g(v 2 ) = A 3 exp (Bv 2 ) 

where A and B are constants. A is simply a normalising factor and B may be 
found by calculating the pressure exerted by the gas and relating it to the 
perfect gas law. The weakness of the proof (as Maxwell recognised) is that it 
cannot be assumed that the velocity components are independent. This was first 
proved by Boltzmann using the //-theorem. 

6.2 Substitute equation 6.1 into 6.3. Then 



The first integral is equal to (N/V) and the second to 3NkT/2Vso 

S = - kVH = |Nk In T + Nk In + constant 

in agreement with equation 3.40 once the identity of C v and 3Nk/2 has been 
established (exercise 4.1 or section 7.5). 

6.3 From equation A4.18. 

Ae = h 2 /2mL 2 = 2.4 x 1(T 33 J = 1.5 x 1(T 14 eV 

(see appendix IV). 

Then 


T x = A e/k = 1.7 x 1(T 10 K 

The equivalent temperature (« 10“ 14 K) for a hydrogen molecule is given 
simply by changing the mass. 

6.4 Total energy can only be an integer multiple of e 0 ; E = Me 0 where M is 
given by the sum of 

N 

M - n x + n 2 +...+«#= 2 n r 


and n r e 0 is the energy of the rth oscillator. The number of distinguishable 
microstates is the number of ways of bringing about M by changing the n r . 
This is equal to the number of ways of distributing M objects into N groups. 
Now N groups can be formed by using (N — 1) partitions between objects. 
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Total number of permutations is now (M + N — 1)! but we are not concerned 
with permutations between objects or partitions so 

W(E) = (M + N - \)\/M\(N - 1)! 

This type of calculation is discussed further in chapter 7. 

6.5 The probability of a particle being in the state of lowest energy for 
example is 

^ = (4 x 5) + (3 x 20) + (2 x 30) + (3 x 10) + (5 xl) 

Total number 

175 1 

350 2 

P(0) = 0.50, P(e) = 0.29, P(2e) = 0.14, P(3e) = 0.06, 
P( 4e) = 0.01 

The most probable arrangement occurs 30 times and within this arrangement 
P( 0) = 0.40, P(e) = 0.40, P(2e) = 0.20, P(3e) = 0, 

P( 4e) = 0 

The agreement between the two distributions is already quite good for five 
particles and becomes rapidly better as the number of particles increases. An 
extensive discussion of this type of calculation may be found in J. G. Powles, 
Particles (Addison-Wesley, London, 1968). 

6.6 W = (TV + n)\jN\n\\ E = ne 0 
Therefore 

S = k In W = k[(N + n)\n(N + n) — TV In TV —nlnn\ 

Now either 

(bSldE) v = 1 IT = (dSldn) v /e 0 

Then 

(dS/dn) v = e 0 IT = k[ In (TV + n) — In n] 
n = (TV + n) exp (— /3e 0 ) ~ TV exp (— j3e 0 ) 
or F = E ~ TS = ne 0 - k[(N + n) In (TV + n) — TV In TV - n In n] 

and, since F is a minimum at thermal equilibrium, (9 F/dn) v = 0. This type of 
imperfection in a crystal is called a Schottky defect. 

6.7 Distinguishable: g™ r = 9 Bosons: 4!/2!2! = 6. 

Fermions: 31/2111 = 3. 
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6.8 Helium e _a ~ 10~ s ^ 1 

Sodium one electron per atom. Lattice spacing about 0.3 nm. Hence 
e““ » 2 x 10 3 ; a » - 200. 

7.1 A/kT = 2n 0 H*/kT = 7 x 10~ 4 

for the nucleus. Therefore it is still in the high temperature limit (exercise 7.2). 
A/kT = 1.4 

for the ion, so the full expression for the thermal capacity is required. 

7.2 C> = [Nk/Pe 2 exp (J3e)] /[I + exp(/3e)] 2 
High-temperature limit: 

C v = (. Ne 2 l4k)T ~ 2 0Uo) 

In the helium region (4 K) the thermal capacity may increase due to this term 
while other contributions to the thermal capacity are decreasing (section 10.3). 

7.3 The partition function is given in equations 7.49 and 7.50. Then 

F=-kT\nZ,S = - (dF/dT) v = - (9F/9j3) K (dj3/dF) 

Therefore 

S/Nk = j3e o /[exp03e o ) - 1] - ln[l - exp(- j3e 0 )] 

7.4 Use equation 7.6 for the partition function and equation A4.33 for the 
density of single - particle states. Then 

Z = 


F = 

P = 

E = 

7.5 e n -- 

is the energy of the nth energy level and it is 2 n 2 degenerate. Therefore 

PJPi = Sn exp(- pe n )lgi exp(- fa) = n 2 ex p[0(e! ~ e n )] 


1 

N\ 


KK)£ 

/ s N 

N\ \ /z 3 c 3 / 


N 


e 2 e de 


[(fc7) 3 r(4)]* 


— kT\n Z = - NkT[\n(T 3 V + constant] 

— (9F/9 V) T = NkT/V, as for a Newtonian gas but 

F - T(dF/dT) v = 3 NkT = 3PV (compare with 
equation 7.67) 

— UR In 2 
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P 4 :P 3 :P 2 :P 1 = 2 x 1(T 10 :4 x 10“ 12 :2 x 10“ lo :l 
The upper levels are sparsely populated, even at 5000 K. 

7.6 Pj/Pq = (2J + 1) exp[— J(J + l)h 2 ISn 2 IkT) 

P 4 :P 3 :P 2 :P 1 :P 0 = 3 x 10“ 2 :0.2:0.9:1.6:1 

Notice that more molecules are in the first rotational energy level than in the 
ground state. 

For equal populations 

7 exp(— \2h 2 lSn 2 IkT) = 5 exp(- 6h 2 /Sn 2 IkT). 

Therefore 


T = 1.5 x 10 3 K 
8.1 1)3 = 0 (equation 8.1) 

v 3 = ^ v 5 exp(- mv 2 l2kT) d vj v 2 exp(— mv 2 /2kT)dv 

= %/(128A: 3 r 3 /7rw 3 ) 

(Substitute m\Pj2kT = x. The integrals then reduce to standard form as in 
appendix II.) v 3 may also be written in terms of IT (4kTv/m). 

8.3 The number of molecules striking unit area in unit time is rw/4 = Pv/4kT . 
Number of molecules to cover tmit area « 4lnd 2 . 

Time = \6kTlPviui 2 » 22 s 

To half - cover the surface at a pressure 10 3 lower, would take 22 x \ x 10 3 s 
«3h. A pressure of about 10“ 10 torr is therefore necessary if a surface is to 
remain clean during an experiment. 

8.4 The speed distribution function for the beam is of the form of equation 
8.23. Therefore 


v 2 = J V s exp(— mv 2 /2kT) d vj J v 3 exp(— mv 2 /2JcT) du 
= 4kT/m 

~e = mv 2 l2 = 2kT 

The mean energy of a molecule in the container is 3kT/2, but fast molecules are 
more likely to reach the aperture and go into the beam. 


8.5 In a time d t the number of molecules which leave the container through a 
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hole of area A is 

drti = — n x Vi A dr; n x = n x ° exp(- v x At) 

Similarly for molecules of type 2: 

n 2 = n 2 ° exp(- v 2 At) 

The concentration is 

n x l(rii + n 2 ) = «i 0 /[hi 0 + n 2 ° expfo - v 2 )At] 

complete separation is therefore only possible as t approaches infinity, that is 
as n x approaches zero. 

8.6 The number of electrons striking unit area of the surface in unit time with 
velocity in the x - direction greater than v 0 , where 0 = mv 0 2 1 2 is 

R = n(m/27rkT) l t 2 f v x exp(- mv 2 /2kT) dv x 

J Uq 

writing z = mv 2 /2kT and integrating 

R = n(kTl2ivri) l l 2 exp(- <p/kT) 

This is the first Richardson equation for the number of electrons emitted from 
a heated filament. It is not quite correct because the electrons must be treated 
by Fermi - Dirac statistics. The term T l l 2 then becomes T 2 but in fact the 
temperature dependence is dominated by the exponential term. 

9.1(a) Using equation 9.22, with a equal to 2.5 where Mis the molecular 
weight and C v is equal to 3R/2 for 1 mole. 

k = 14.5 x 10“ 2 W m -1 s" 1 deg" 1 

(b) r? = 0.5 nm vX = (O.SXPM/RW&RT/ttM) 1 ! 2 

Therefore 

X = 1.8 x 10" 7 m 

(c) Using equation 9.6: d = 2.2 x 10" 10 m (note that d < X < dimension 
of container) 

(d) ; X a TIP so using the result of (c) 

X = 2.2 x 10" 10 (76077273P) (Pintorr) 

Therefore 

P = 1.5 x 10" 2 torr (300 K) and 1.5 x 10" 3 torr (30 K) 

Therefore X » dimensions of the system even in a poor vacuum (compare 
exercise 8.3). 
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9.2 


Poiseuille flow 
Free molecular flow 


\6r\RTjavP = 1 677 / 01 ; p = 8 X/a 


The ratio is therefore equal to 8 X/ 0 , which is equal to 16 for a mean free 
path equal to the diameter. The observed mass flow is therefore an order of 
magnitude greater than would have been expected on the basis of Poiseuille 
flow. 


9.3 A change of the total mass of molecules of type 1 in the container of 
volume V leads to a change in pressure since 

A m = m x AN X = m x V AP x /kT 

where m x is the mass of a molecule of type 1. Then from equation 9.28 
= - aPi/y/m 

where a contains the parameters independent of the nature of the gas. 
Therefore 

Pi (0 = Pi ( 0 ) exp(- at/yjm) = ^( 0 ) exp(-r/r) 

Therefore 

t = a~ l \Jm 

The composition of a gas mixture in the system therefore changes with time 
since r is proportional to the square root of the mass of a molecule. 

9.4 The radiation loss from unit area in unit time is, from Stefan’s law 

Rr = a(T w 4 - r 4 ) ~ 4 oT\T w - T) 

using the binomial theorem. The heat loss through the gas is given by equation 
9.31 so the pressure at which R r = R% is 

^min = %oT 4 fv = 6.2 x 10~ 2 torr 

P m in is lowered by a factor equal to the emissivity of the wire if it does not 
behave like a black body. 

10.1 The Joule - Kelvin coefficient for a gas with the equation of state 

PV = RT + B P P is (dT/dP) H = [ T(dB P ldT) - B P ]IC P 
Equation 10.8 may be written for the Bose - Einstein gas in the form 
PV = RT( 1 - B v \V ...) 

Hence 

B v = - Nh 3 lg(2mnkT) 3 l 2 = B P (exercise 2.2) 

Therefore 

(dT/dP) H = 5Nh 3 l2C P g(2nmkTf 12 
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The right-hand side is always positive so a Bose - Einstein gas always cools in a 
Joule - Kelvin expansion. 

The expression for the Fermi - Dirac gas is the same apart from a negative 
sign and therefore always increases in temperature after a Joule - Kelvin expan¬ 
sion. (The values of C P are different for the two gases since 

C P = C F + T(bPlbT) v (bV/bT) P « C v + R + 2PdB P /dT 

and Cy = 3R/2.) 

The perfect quantum gases therefore behave like classical gases with attractive 
(Bose - Einstein) or repulsive (Fermi - Dirac) forces between the molecules 
(compare exercise 5.4). 


10.2 

Therefore 

10.3 


Therefore 


PV = 2E/3 = 2Ne/3 (equation 10.5) 

F = 3ep/5 ~ 3 eV (equation 10.15) 

P = (2ep x volume per electron)/5 = 19 x 10 4 atmospheres 

u = 4 oT*/c (equations 10.24 and 10.25) 

C rad = 16aT 3 V\c\ C gas = 3Nk\2 = 3PV/2T (constant 

volume) 

Qad/Cgas = 32oT 4 l3P * 6 X 10- 10 


10.4 As a function of frequency, using equation 10.39, (hv m3LX = 2.82 kT). 
Using c = *>A, dp = — c dX/X 2 and equation 10.36 

w(X,T)dX = 8ithc dX/X 5 [exp(fihc/X) - 1] 

( bu/b\) T = 0 for a maximum in the curve. 

Therefore 


Therefore 


Therefore 


he = 5fc7X max [l - exp(- hc/kTk)] 
hc/kTK max « 5 (actually 4.965) 
^max^max ^ 0.6c 


The difference arises because a system with a constant-frequency bandwidth 
allows a greater range of wavelengths to pass at low frequency 

(AX = cAp/v 2 ) 
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10.5 The density of states as a function of the magnitude of the wave vector k 
may be written in the form (equations A4.11, A4.16 and A4.23) 

D(k ) = constant x k!*~ l (n dimensions) 

In the low-temperature limit the dispersion relation is simply 

v = c s fK = c s kl2n 

where c s is the velocity of sound. 

Therefore 

D(v)dv = D(k)dk = constant x v n ~ 1 dv 
Substituting in equation 10.47 

E = constant x \ v n [exp(j3hv) - l]~~ l dv 
J o 

= constant x r (w+1) J^x n [exp(x) - l ] -1 dx 

and the integral is also a constant so 

E cc T^ n+1 ^',C v & T n (for n dimensions) 

\§.6D(k)dk = constant x k 2 dk as before but now 

D(v)dv = D(k)dk = constant x \/v dv (three dimensions) 
Using equation 10.47 and writing x = fihv 

E = constant x T 5 ^ 2 f x 3 t 2 [exp(x) — l ]" 1 dx 

Jo 

Therefore 

E oc T 5 I 2 \C V cx 7 13 / 2 

This magnetic contribution to the low-temperature specific heat has been 
observed in ferromagnetic insulators but is obscured by the electronic specific 
heat (proportional to T) in ferromagnetic metals. 

10.7 The calculation is identical with exercise 10.5 since the form of the dis¬ 
persion relation is the same. The magnetic specific heat of an antiferromagnet is 
therefore proportional to T 3 at low temperature. 

10.8 Using equation 10.113 

S$ = 0.58 N s k (when* = 1 ) 

The lattice entropy 

S t = \ 10 ~ s NkT 3 (at temperature T) 
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Therefore 

S s = S t (at 14 K) 

AS s = — 0.2 N s k Ax = 0.2N s kAT/T 2 (at* = 1) 

where 

AT = 7f — T 2> AS, = |l0 _s Nk(Tf 3 - 7\ 3 ) 

** - j 10“ 5 NkT x 3 AS s + ASi = 0 

Therefore 

AT/T 2 » 10 —3 7j 3 

AT = 2 T 2 when 7\ = 10 K. AT = 10~ 3 T 2 when 7) = IK. 

10.9 Cfj = T(dSldT)fj* (by definition) 

Therefore 

C„ = - T(dS/dH*) T (dH*ldT) s 


(M*/dT) s = QH*l*njr 


Therefore 


(for a simple paramagnet, equation 10.108) 


C„ = - n 0 T(dJ?ldT) H .(dH*/bT)jf (using equation 10.106) 


Therefore 

C„ = ix 0 CH* 2 /T 2 
Using equation 10.110 

CdT r 

c h 

Therefore 

ln(r 2 /r,) = In (HM) 
H 2 /T 2 = HJT , 
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